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Preface 


The purpose of this book is to isolate and draw attention to the most 
important problem-solving techniques typically encountered in undergradu¬ 
ate mathematics and to illustrate their use by interesting examples and 
problems not easily found in other sources. Each section features a single 
idea, the power and versatility of which is demonstrated in the examples 
and reinforced in the problems. The book serves as an introduction and 
guide to the problems literature (e.g., as found in the problems sections of 
undergraduate mathematics journals) and as an easily accessed reference of 
essential knowledge for students and teachers of mathematics. 

The book is both an anthology of problems and a manual of instruction. 
It contains over 700 problems, over one-third of which are worked in detail. 
Each problem is chosen for its natural appeal and beauty, but primarily for 
its unique challenge. Each is included to provide the context for illustrating 
a given problem-solving method. The aim throughout is to show how a 
basic set of simple techniques can be applied in diverse ways to solve an 
enormous variety of problems. Whenever possible, problems within sections 
are chosen to cut across expected course boundaries and to thereby 
strengthen the evidence that a single intuition is capable of broad applica¬ 
tion. Each section concludes with “Additional Examples” that point to 
other contexts where the technique is appropriate. 

The book is written at the upper undergraduate level. It assumes a 
rudimentary knowledge of combinatorics, number theory, algebra, analysis, 
and geometry. Much of the content is accessible to students with only a 
year of calculus, and a sizable proportion does not even require this. 
However, most of the problems are at a level slightly beyond the usual 
contents of textbooks. Thus, the material is especially appropriate for 
students preparing for mathematical competitions. 
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Preface 


The methods and problems featured in this book are drawn from my 
experience of solving problems at this level. Each new issue of The 
American Mathematical Monthly (and other undergraduate journals) con¬ 
tains materia] that would be just right for inclusion. Because these ideas 
continue to find new expression, the reader should regard this collection as 
a starter set and should be encouraged to create a personal file of problems 
and solutions to extend this beginning in both breadth and depth. Obvi¬ 
ously, we can never hope to develop a “system” for problem-solving; 
however, the acquiring of ideas is a valuable experience at alt stages of 
development. 

Many of the problems in this book are old and proper referencing is very 
difficult. I have given sources for those problems that have appeared more 
recently in the literature, citing contests whenever possible. I would appreci¬ 
ate receiving exact references for those I have not mentioned. 

f wish to take this opportunity to express my thanks to colleagues and 
students who have shared many hours of enjoyment working on these 
problems. In this regard I am particularly grateful to O. E. Stanaitis, 
Professor Emeritus of St. Olaf College. Thanks to St. Olaf College and the 
Mellon Foundation for providing two summer grants to help support the 
writing of this manuscript. Finally, thanks to all individuals who contrib¬ 
uted by posing problems and sharing solutions. Special acknowledgement 
goes to Murray s. Klamkin who for over a quarter of a century has stood 
as a giant in the area of problem-solving and from whose problems and 
solutions I have learned a great deal. 


March 21, 1983 


Loren C. Larson 
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Chapter 1. Heuristics 


Strategy or tactics in problem-solving is called heuristics. In this chapter we 
will be concerned with the heuristics of solving mathematical problems. 
Those who have thought about heuristics have described a number of basic 
ideas that are typically useful. Among these, we shall focus on the follow¬ 
ing: 

(1) Search for a pattern. 

(2) Draw a figure. 

(3) Formulate an equivalent problem. 

(4) Modify the problem. 

(5) Choose effective notation. 

(6) Exploit symmetry. 

(7) Divide into cases. 

(8) Work backward. 

(9) Argue by contradiction. 

(10) Pursue parity. 

(11) Consider extreme cases. 

(12) Generalize. 

Our interest in this list of problem-solving ideas is not in their descrip¬ 
tion but in their implementation. By looking at examples of how others 
have used these simple but powerful ideas, we can expect to improve our 
problem-solving skills. 

Before beginning, a word of advice about the problems at the end of the 
sections: Do not be overly concerned about using the heuristic treated in 
that section. Although the problems are chosen to give practice in the use 
of the heuristic, a narrow focus may be psychologically debilitating.' A 
single problem usually admits several solutions, often employing quite 


1 
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different heuristics. Therefore, it is best to approach each problem with an 
open mind rather than with a preconceived notion about how a particular 
heuristic should be applied. In working on a problem, solving it is what 
matters. It is the accumulated experience of all the ideas working together 
that will result in a heightened awareness of the possibilities in a problem. 


1.1. Search for a Pattern 

Virtually all problem solvers begin their analysis by getting a feel for the 
problem, by convincing themselves of the plausibility of the result. This is 
best done by examining the most immediate special cases; when this 
exploration is undertaken in a systematic way, patterns may emerge that 
will suggest ideas for proceeding with the problem. 

1.1.1. Prove that a set of n (different) elements has exactly 2" (different) 
subsets. 


When the problem is set in this imperative form, a beginner may panic 
and not know how to proceed. Suppose, however, that the problem were 
cast as a query, such as 

(i) How many subsets can be formed from a set of n objects? 

(ii) Prove or disprove: A set with n elements has 2" subsets. 

In either of these forms there is already the implicit suggestion that one 
should begin by checking out a few special cases. This is how each problem 
should be approached: remain skeptical of the result until convinced. 


Solution 1 . We begin by examining what happens when the set contains 
0,1,2,3 elements; the results are shown in the following table: 


Elements Number of 

n of S Subsets of S subsets of S 


0 none 0 1 

1 x, 0, {xj 2 

2 x„x 2 0 , {*,}, {* 2 }. 4 

3 x it x 2 ,xj 0, {*[}, {x 2 }, {x„x 2 } 8 


{* 3 }- {-* 2 .* 3 ). 


Our purpose in constructing this table is not only to verify the result, but 
also to look for patterns that might suggest how to proceed in the general 
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case. Thus, we aim to be as systematic as possible. In this case, notice when 
n = 3, we have listed first the subsets of {*,, x 2 ] and then, in the second 
line, each of these subsets augmented by the element x 3 . This is the key 
idea that allows us to proceed to higher values of n. For example, when 
n = 4, the subsets of S = {Xj,x 2 ,x 3 ,x 4 } are the eight subsets of {x u x 2 ,x 3 } 
(shown in the table) together with the eight formed by adjoining x 4 to each 
of these. These sixteen subsets constitute the entire collection of possibili¬ 
ties; thus, a set with 4 elements has 2 4 ( = 16) subsets. 

A proof based on this idea is an easy application of mathematical 
induction (see Section 2.1). 

Solution 2. Another way to present the idea of the last solution is to argue 
as follows. For each n, let A n denote the number of (different) subsets of a 
set with n (different) elements. Let S be a set with n + 1 elements, and 
designate one of its elements by x. There is a one-to-one correspondence 
between those subsets of S which do not contain x and those subsets that 
do contain x (namely, a subset T of the former type corresponds to 
ru(r)). The former types are all subsets of S — {x}, a set with n 
elements, and therefore, it must be the case that 

+1 = 2A„. 

This recurrence relation, true for n = 0,1,2,3,. .. , combined with the fact 
that A 0 = 1, implies that A„ = 2". (A n = 2A n ^ l = 2 2 A n _ 2 = = 2"A 0 

= 2 fl .) 

Solution 3. Another systematic enumeration of subsets can be carried out 
by constructing a “tree”. For the case « = 3 and S = {a,b,cj, the tree is as 
shown below: 

Subset 
{a,b, c) 

[a, b} 

{a, c} 

{a} 

{b,c} 

{b} 

{c} 

9 

Each branch of the tree corresponds to a distinct subset of S (the bar over 
the name of the element means that it is not included in the set correspond¬ 
ing to that branch). The tree is constructed in three stages, corresponding to 
the three elements of S. Each element of S leads to two possibilities: either 
it is in the subset or it is not, and these choices are represented by two 
branches. As each element is considered, the number of branches doubles. 
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Thus, for a three-clement set, the number of branches is 2 X 2 X 2 = 8. For 
an n-e]ement set the number of branches is 

2 X 2 X • • • X 2 = 2"; 
n 

thus, a set with n elements has 2 " subsets. 

Solution 4. Suppose we enumerate subsets according to their size. For 
example, when S = {a,h,e,d}, the subsets are 


Number of 
elements 


Number of 
subsets 

0 

0 

1 

1 

{«},{&}, {e},{rf} 

4 

2 

{a,b}, {a,c}, {a,d}, {h,c}, {b,d}, {c,rf} 

6 

3 

{a,b,c), { a 7 b,d }, {a,c,d} f {b,c,d} 

4 

4 

{a,b,c,d} 

1 


This beginning could prompt the following argument. Let S be a set with 
n elements. Then 

n 

No. of subsets of S' = 2 (No. of subsets of S with k elements) 

it = 0 



The final step in this chain of equalities follows from the binomial theorem, 

i o ( n k yy- k ’ 

upon setting x = 1 and _y = 1 . 

Solution 5. Another systematic beginning is illustrated in Table 1.1, which 
lists the subsets of S = {jc l3 jc 2 , -^ 3 }- To understand the pattern here, notice 
the correspondence of subscripts in the leftmost column and the occurrence 


Table 1.1 


Subset 

Triple 

Binary number 

Decimal number 

0 

( 0 , 0 , 0 ) 

0 

0 

{* 3 } 

( 0 , 0 , 1 ) 

1 

1 

{* 2 } 

( 0 , 1 , 0 ) 

10 

2 

{* 2 , * 3 } 

( 0 , 1 , 1 ) 

11 

3 

(*l) 

( 1 , 0 , 0 ) 

100 

4 


(LO, 1 ) 

101 

5 


( 1 , 1 , 0 ) 

no 

6 


( 1 , 1 , 1 ) 

111 

7 
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of Ps in the second column of triples. Specifically, if A is a subset of 
S = {x,,jc 2 , .. . , x n }, define a jt for / = 1,2.n, by 

f 1 if a,^A, 

a '~(o if a,&A. 

It is clear that we can now identify a subset A of S with (Oj, a 2 ,. . . , a n ), an 
«-tuple of 0’s and l’s- Conversely, each such n-tuple will correspond to a 
unique subset of S. Thus, the number of subsets of S is equal to the 
number of n-tuples of 0’s and Ps. This latter set is obviously in one-to-one 
correspondence with the set of nonnegative binary numbers less than 2". 
Thus, each nonnegative integer less than 2” corresponds to exactly one 
subset of S, and conversely. Therefore, it must be the case that S has 2" 
subsets. 

Normally, we will give only one solution to each example—a solution 
which serves to illustrate the heuristic under consideration. In this first 
example, however, we simply wanted to reiterate the earlier claim that a 
single problem can usually be worked in a variety of ways. The lesson to be 
learned is that one should remain flexible in the beginning stages of 
problem exploration. If an approach doesn’t seem to lead anywhere, don’t 
despair, but search for a new idea. Don’t get fixated on a single idea until 
you’ve had a chance to think broadly about a variety of alternative 
approaches. 


1.1.2. Let S„o, S„ lt and S„ 2 denote the sum of every third element in the 
nth row of Pascal’s Triangle, beginning on the left with the first element, 
the second element, and the third element respectively. Make a conjecture 
concerning the value of 5 100 l . 


Solution. We begin by examining low-order cases with the hope of finding 
patterns that might generalize. In Table 1.2, the nonunderlined terms are 
those which make up the summands of S n 0 ; the singly underlined and 


Table 1.2 


Pascal’s triangle 

n 

*^B.O 

^J7.l 

S„,2 

1 


r 

0 

0 

1 1 

i 

i 

1 

0 “ 

1 1 I 

2 

i 

2 + 

1 

13 3 1 

3 

2 “ 

3 

3 

14 6 4 1 

4 

5 

5 

6 + 

15 10 10 5 1 

5 

11 

io- 

11 

JUl. 20 J5 6 1 

6 

22 + 

21 

21 

I 7 2) 35 35 21 7 1 

7 

43 

43 

42“ 

—i— -^1 ■ ii? '~5% % —1— 

-S- 

-- 

- 

—- 
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doubly underlined terms are those of S„, and S„ 2 , respectively. The three 
columns on the right show that, in each case, two of the sums are equal, 
whereas the third is either one larger (indicated by a superscript +) or one 
smaller (indicated by a superscript —). It also appears that the unequal 
term in this sequence changes within a cycle of six. Thus, from the pattern 
established in the first rows, we expect the anomaly for n = 8 to occur in 
the middle column and it will be one HSarthan the other two. 

We know that S rt 0 + S nA + S„ a “ 2" (see 1.1.1). Since 100 = 6 X 16+4, 
we expect the unequal term to occur in the third column (S 1 ^ 2 ) and to be 
one more than the other two. Thus S 1000 = S 100 A = S 1002 - 1, and S 100J + 
S,oo.i + S’Loo,, + 1 = 2 J0 °. From these equations we are led to conjecture 
that 



A formal proof of this conjecture is a straightforward application of 
mathematical induction (see Chapter 2). 


1 . 13 . Let *!, jc 2 ,.. . be a sequence of nonzero real numbers satisfying 


2x n _ 2 -x„ 


n = 3,4,5, ... . 


Establish necessary and sufficient conditions on and x 2 for x n to be an 
integer for infinitely many values of n. 


Solution. To get a feel for the sequence, we will compute the first few terms, 
expressing them in terms of x, and x 2 . We have (omitting the algebra) 

- 

Xi 2x, — x 2 ’ 

- Xl * 2 
* 4 3*! - 2x 2 * 

- *1*2 

* 5 4x, — 3x 2 

We are fortunate in this particular instance that the computations are 
manageable and a pattern emerges. An easy induction argument establishes 
that 

= _ *lX 2 

X " («- l)jr I -<n-2)* 2 ’ 

which, on isolating the coefficient of n, takes the form 

* 1*2 


" (*I - * 2 )" + (2*2 - *l) 
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In this form, we see that if Ay =f= x 2 , the denominator will eventually exceed 
the numerator in magnitude, so x„ then will not be an integer. However, if 
jc, = x 2 , all the terms of the sequence are equal. Thus, x n is an integer for 
infinitely many values of n if and only if a, = x 2 . 

1.1.4. Find positive numbers n and a y ,a 2 , .. ., a„ such that a t + ■ ■ ■ + a„ 
= 1000 and the product a l a 2 ■ ■ ■ a n is as large as possible. 


Solution. When a problem involves a parameter which makes the analysis 
complicated, it is often helpful in the discovery stage to replace it temporar¬ 
ily with something more manageable. In this problem, we might begin by 
examining a sequence of special cases obtained by replacing 1000 in turn 

with 2,3,4,5,6,7,8,9.In this way we are led to discover that in a 

maximum product 

(i) no a, will be greater than 4, 

(ii) no a, will equal I, 

(iii) all a/s can be taken to be 2 or 3 (because 4 = 2x2 and 4 = 2 + 2), 

(iv) at most two a/s will equal 2 (because 2x2x2<3x3 and 2 + 2 + 2 
= 3 + 3). 

Each of these is easy to establish. Thus, when the parameter is 1000 as in 
the problem at hand, the maximum product must be 3 332 X 2 2 . 


1.1.5. Let S be a set and * be binary operation on S satisfying the two 
laws 

x*x = X for all x in S, 

(x *y) * z = (y * z) * x for all x,y,z in S. 

Show that x *y = y * x for all x, y in S. 


Solution. The solution, which appears so neatly below, is actually the end 
result of considerable scratch work; the procedure can only be described as 
a search for pattern (the principle pattern is the cyclic nature of the factors 
in the second condition). We have, for all x, y in S, x *y = (x *y) * (x *y) 
= [y*(x*y)]*x = [(x*y)*x]*y = [(y * x) * x] *y = [(x*x)*y]*y 
= [(y*y)J *(-***) = y*x. 


Problems 

Develop a feel for the following problems by searching for patterns. Make 
appropriate conjectures, and think about how the proofs might be carried 
out. 
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1.1.6. Beginning with 2 and 7, the sequence v 2,7 (l 1,4,7,4,2,8-,... is con¬ 
structed by multiplying successive pairs of its members and adjoining the 
result as the next one or two members of the sequence, depending on 
whether the product is a one- or a two-digit number. Prove that the digit 6 
appears an infinite number of times in the sequence. 

1.1.7. Let 5, denote the sequence of positive integers 1,2,3,4,5,6, . . . , 
and define the sequence S w+ , in terms of S„ by adding 1 to those integers in 

S„ which are divisible by n. Thus, for example, S 2 is 2,3,4,5,6,7. S } 

is 3,3,5,5,7,7.Determine those integers n with the property that the 

first n — 1 integers in S„ are n. 

1.1.8. Prove that a list can be made of all the subsets of a finite set in such 
a way that 


(i) the empty set is first in the list, 

(ii) each subset occurs exactly once, and 

(iii) each subset in the list is obtained either by adding one element to the 
preceding subset or by deleting one element of the preceding subset. 


1.1.9. Determine the number of odd binomial coefficients in the expansion 
of (x+_y) 1000 . (See 4.3.5.) 

1 . 1 . 10 . A well-known theorem asserts that a prime p > 2 can be written as 
a sum of two perfect squares (p = m 2 + n 1 , with m and n integers) if and 
only if p is one more than a multiple of 4. Make a conjecture concerning 
which primes p > 2 can be written in each of the following forms, using 
(not necessarily positive) integers x and y: (a) x 1 + 16_y 2 , (b) 4x 2 + 4 xy + 
5 y*. (See 1.5.10.) 


1.1,11. If (a n ) is a sequence such that for n > 1, (2 — «„)£7 n+l = 1, what 
happens to a n as n tends toward infinity? (See 7.6.4.) 


1.1.12. Let S be a set, and let * be a binary operation on S satisfying the 
laws 


x * (x * y) = y for all x, y in S, 

(_y*x)*x=^ for all x,y in S. 
Show that x *y = y * x for all x, y in S. 

Additional Examples 


Q X 




v°la. 

* a 


Most induction problems are based on the discovery of a pattern. Thus, the 
problems in Sections 2.1, 2.2, 2.3, 2.4 offer additional practice in this 
heuristic. Also see 1.7.2, 1.7.7, 1.7.8, 2.5.6, 3.1.1, 3.4.6, 4.3.1, 4.4.1, 4.4.3, 
4.4.15, 4.4.16, 4.4.17. 
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Whenever possible it is helpful to describe a problem pictorially, by means 
of a figure, a diagram, or a graph. A diagrammatic representation usually 
makes it easier to assimilate the relevant data and to notice relationships 
and dependences. 

1.2.1. A chord of constant length slides around in a semicircle. The 
midpoint of the chord and the projections of its ends upon the base form 
the vertices of a triangle. Prove that the triangle is isosceles and never 
changes its shape. 


Solution. Let AB denote the base of the semicircle, let XY be the chord, Af 
the midpoint of XY, C and D the projections of X and Y on AB (Figure 
1.1). Let the projection of M onto AB be denoted by N. Then N is the 
midpoint of CD and it follows that A CMD is isosceles. 

To show that the shape of the triangle is independent of the position of 
the chord, it suffices to show that / MCD remains unchanged, or equiva¬ 
lently, that Z XCM is constant, for all positions of XY. To see that this is 
the case, extend XC to cut the completed circle at Z (Figure 1.2). Then CM 
is parallel to ZY (C and M are the midpoints of XZ and XY, respectively), 


- A 

"i 



% 


x 




z 

Figure 1.2. 
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X 



and consequently LXCM = LXZY. But Z XZY equals one-half the arc 
XY, and this arc depends only on the length of the chord XY. This 
completes the proof. 

One might ask: How in the world did anyone ever think to extend XC in 
this way? This is precisely the step that makes the argument so pretty, and 
it is indeed a very difficult step to motivate. About all that can be said is 
that the use of auxiliary lines and arcs (often found by reflection, extension, 
or rotation) is a common practice in geometry. Just the awareness of this 
fact will add to the possible approaches in a given problem. 

Another interesting approach to this problem is to coordinatize the 
points and to proceed analytically. To show that the shape of the triangle is 
independent of the position of the chord, it suffices to show that the 
height-to-base ratio, MN/ CD, is constant. 

Let O denote the midpoint of AB, and let 0 = Z YOB. It is clear that the 
entire configuration is completely determined by 9 (Figure 1.3). 

Let « =t Z XOY. Using this notation, 

CD = cos0 — cos(0 + a), 
sin# + sin(fl + a) 


0 < 9 < 7T — «. 

It is not immediately clear that this quantity is independent of 9; this is the 
content of 1.8.1 and 6.6.7. 




and the height-base ratio is 

sin# + sin(0 + a) 


F(0) = 


2 (cos# - cos(# + a)) ’ 


1.2.2. A particle moving on a straight line starts from rest and attains a 
velocity » 0 after traversing a distance ,s 0 . If the motion is such that the 
acceleration was never increasing, find the maximum time for the trans¬ 
verse. 


Solution. Focus attention on the graph of the velocity v * c(/) (Figure 1.4). 
We are given that «(0) = 0, and the graph of v is never concave upward 
(because the acceleration, dc/dt, is never increasing). The area under the 
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II 


v 



curve is equal to j 0 (distance traversed = \‘ 0 v{t)dt). From this representa¬ 
tion, it is clear that we will maximize the time of traverse when the curve 
o(/) from 0 to P is a straight line (Figure 1.5). At the maximum time / 0 , 
5 / 0 «o = J 0 ’ or equivalently, ? 0 = 2sJv 0 . 


1.23. If a and b are positive integers with no common factor, show that 



b 


(«-!)(*-!) 

2 


Solution. When b — 1, we will understand that the sum on the left is 0 so 
the result holds. 

It is not clear how a figure could be useful in establishing this purely 
arithmetic identity. Yet, the statement involves two independent variables, 
a and b, and a/b, 2a/b, 7>a/b, . . . are the values of the function 
f{x) = ax/b when x — 1,2,3, .. ., respectively. Is it possible to interpret 
\a/b 1, \2a/b ],. .. geometrically? 

To make things concrete, consider the case a = 5 and b = 7. The points 
P k = {k,5k/l), k — 1,2, . . . , 6, each lie on the line y = 5x/7, and 
15 Ar/7 ] equals the number of lattice points on the vertical line through P k 
which lie above the x-axis and below P k . Thus, 2*-i [ 5A'/7 ] equals the 


V 



Future L5. 
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number of lattice points interior to A ABC (see Figure 1.6). By symmetry, 
this number is one-half the number of lattice points in the interior of 
rectangle A BCD. There are 4 X 6 — 24 lattice points in A BCD, which 
means that triangle ABC contains 12 interior lattice points. 

The same argument goes through in the general case. The condition that 
a and b have no common factor assures us that none of the lattice points in 
the interior of ABCD will fall on the liney = ax/b. Thus, 



i (No. of lattice points in the interior of ABCD ) 


(a-l)(b-\) 

2 


1.2.4 (The handshak e problem). Mr. and Mrs. Adams recently attended 
a party at which there were three other couples. Various handshakes took 
place. No one shook hands with his/her own spouse, no one shook hands 
with the same person twice, and of course, no one shook his/her own hand. 

After all the handshaking was finished, Mr. Adams asked each person, 
including his wife, how many hands he or she had shaken. To his surprise, 
each gave a different answer. How many hands did Mrs. Adams shake? 


Solution. Although a diagram is not essential to the solution, it is helpful to 
view the data graphically in the following fashion. Represent the eight 
individuals by the eight dots as shown in Figure 1.7. 

Now the answers to Mr. Adams’ query must have been the numbers 
0,1,2,3,4,5,6. Therefore, one of the individuals, say A, has shaken hands 
with six others, say B, C,D, E, F, G. Indicate this on the graph by drawing 
line segments from A to these points, as in Figure 1.8. 

From this diagram, we see that H must be that person who has shaken 
no one’s hand. Furthermore, A and H must be spouses, because A has 
shaken hands with six others, not counting his/her own spouse. 



1.2. Draw a Figure 
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E E 

Figure 1.9. Figure 1.10. 


By supposition, one of B, C, D, E, F, G, has shaken five hands. By 
relabeling if necessary we may assume this person is B. Also, we may 
assume without loss of generality that the five with whom B has shaken 
hands are labeled A, C,D,E,F. This is shown in Figure 1.9. From this 
sketch we easily see that G is the only person who could have answered 
“one”, and B and G must be spouses. 

Again, as before, by relabeling the points C, D, E if necessary, we may 
assume that C shook four hands and that they belonged to A,B,D,E. The 
corresponding diagram is given in Figure 1.10. Using the same reasoning as 
above, Band C are spouses, and consequently, D and E are spouses. 

Each of D and E has shaken hands with three others. Since Mr. Adams 
did not receive two “three” answers, D and E must correspond to Mr. and 
Mrs. Adams; that is to say, Mrs. Adams shook hands with three others. 

Problems 

1.2.5. Two poles, with heights a and b, are a distance d apart (along level 
ground). A guy wire stretches from the top of each of them to some point P 
on the ground between them. Where should P be located to minimize the 
total length of the wire? (Hint: Let the poles be erected at points C and D, 
and their tops be labeled A and B, respectively. We wish to minimiz e 
AP + PB. Augment this diagram by reflecting it in the base line CD. 
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Suppose B reflects to B' (PB = PB'). Now the problem is: Where should P 
be located to minimize AP + PB'?) 

1 . 2 . 6 . Let ABC be an acute-angled triangle, and let D be on the interior of 
the segment AB. Locate points E on AC and F on CB such that the 
inscribed triangle DEF will have minimum perimeter. (Hint: Reflect D in 
line AC to a point D'; reflect D in CB to a point D" and consider the line 
segment D'D*.) 

1 . 2 . 7 . A rectangular room measures 30 feet in length and 12 feet in height, 
and the ends are 12 feet in width. A fly, with a broken wing, rests at a point 
one foot down from the ceiling at the middle of one end. A smudge of food 
is located one foot up from the floor at the middle of the other end. The fly 
has just enough energy to walk 40 feet. Show that there is a path along 
which the fly can walk that will enable it to get to the food. 

1 . 2 . 8 . Equilateral triangles ABP and ACQ are constructed externally on 
the sides AB and AC of triangle ABC . Prove that CP = BQ. (Hint: For a 
nice solution, rotate the plane of the triangle 60° about the point A, in a 
direction which takes B in the direction of C. What happens to the line 
segment CP?) 

1.2.9. Let a and b be given positive real numbers with a < b. If two points 

are selected at random from a straight line segment of length b, what is the 
probability that the distance between them is at least a? (Hint: Let x and y 
denote the randomly chosen numbers from the interval [0,6], and consider 
these independent random variables on two separate axes. What area 
corresponds to 1* — > a?) 

1 . 2 . 10 . Give a geometric interpretation to the following problem. Let f be 
differentiable with /' continuous on [a, b\. Show that if there is a number c 
in (a, b\ such that /'(c) = 0, then we can find a number d in (a, b) such that 

b — a 



1.2.11. Let a and b be real numbers, a < b. Indicate geometrically the 
precise location of each of the following numbers: (a + b)/2 (= ]a + tft); 
|a + %a + %b; [m/(m + rt)]a + [n/{m + n)]b, where m>0 and 
n > 0. (The latter number corresponds to the center of gravity of a system 
of two masses—one, of mass m, located at a, and the other, of mass n, 
located at b.) 


1.2.12. Use the graph of y = sin* to show the following. Given triangle 
ABC , 


(a) sin B. + sin C <sinJ g±_C 


(b) —sin 5 + —5— sinC < sinf B + —Clm>0,«> 0. 
y ' m + n m + n \m + » m + n ) 
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1.2.13. Use a diagram (a rectangular array 0,a ; » to show that 

(^) 


<*>) 


E E a i a j S a i a j - 

i-oy-o y=Oi=0 

n n n n 

E E ^ = E E ■ 

j-Oi-j i-Oy-i 


( n \ 2 n n n : n 

E ^ = E E «,«, = 2 E S «;<* - E a ? ■ 

1-0 / 1 = 0y = 0 ( = 0 y = 0 1 = 0 


Additional Examples 

Most of the problems in Chaper 8 (Geometry); also 1.3.11, 1.9.2, 1.9.4, 
1.11.3, 2.1.3, 2.5.5, 2.6.11, 5.1.2, 6.2.2, 6.4.1, 6.6.3, 6.8.1, 7.1.14, 7.4.19, 7.6.1, 
8 . 1 . 1 . 


1.3. Formulate an Equivalent. Problem 


The message of the preceding section is that the first step in problem 
solving is to gather data, to explore, to understand, to relate, to conjecture, 
to analyze. But what happens when it is not possible to do this in a 
meaningful way, either because the computations become too complicated 
or because the problem simply admits no special cases that shed any 
insight? In this section we will consider some problems of this type. The 
recommendation of this section is to try to reformulate the problem into an 
equivalent but simpler form. The appeal is to one’s imagination and 
creativity. Some standard reformulation techniques involve algebraic or 
trigonometric manipulation, substitution or change of variable, use of 
one-to-one correspondence, and reinterpretation in the language of another 
subject (algebra, geometry, analysis, combinatorics, etc.). 


13 . 1 . Find a general formula for the,nth derivation of f(x) = 1/(1 — x 2 ). 


Solution. A common simplifying step when working with rational functions 
is to write the function as a sum of partial fractions. In this case, 


/(*) * A [ ~— + -r- f 1, 

2(1— x 1+x 
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and in this form it is easy to show that 


/ (n> (*) 


«! 

2 


1 


+ 


(- 0 " 

(1 + JC) fl + l 


13.2. Find all solutions of x A + x 3 + x 1 + x + 1 = 0. 


Solution. This equation can be solved by dividing by x 2 , then substituting 
y — x + 1 / x, and then applying the quadratic formula. Thus, we have 

* 2 +-L+*+- + l=0, 

x 2 * 

(^ + 2 + X) + (, + I) + (1 - 2) ,o, 

The roots of this equation are 

-1+V5 — i — Vs" 

7[ ---. , 2 --2-• 

It remains to determine x by solving the two equations 

* + ~=Ti and x+±=y 2 , 

which are equivalent to 

* 2 — /!* + !=() and x 2 — y 2 x + 1 = 0. 

The four roots found by solving these are 


-1+75 . V 10 + 2 ^ 

A ' * A ~ 


_-l+V5 .710 + 275 

' ^ A t A ~ 


-1-75 _ , Vio-27^ 

4 +/ 4 


-1-75 . VlQ — 275 


Another approach to this problem is to multiply each side of the original 
equation by x — 1. Since (jc — IX-* 4 + x 3 + x 2 + x + 1) = * 5 — 1, an equiv- 
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alent problem is to find all x (other than x = 1) which satisfy x 5 = 1. These 
are the five fifth roots of unity, given by 

x } = cos+ /sin|ir, 

x 2 =* cosf it + i'sin|jr, 

x 3 = cosfrr + /sin|w, 

x A = cos | jt + t sin 177, 

*5 = 1 . 

As a by-product of having worked this problem two different ways, we 
see that 


COS 177 + <'sinf7r = 


_ -1 +V5 . V 10 + 2 ^ 

4 + ' '4 


Equating real and imaginary parts yields 

-1 +V5 


cos 72° = 


sin 72° 


VlO + 2VJ 

4 


(Similar formulas can be found for jr 2 , * 3 . and x 4 .) 


133. P is a point inside a given triangle ABC; D, E,F are the feet of the 
perpendiculars from P to the lines BC, CA, AB, respectively. Find all P for 
which 

BC.CA.AB 
PD PE PF 

is minimal. 


Solution. Denote the lengths of BC, AC, AB by a,b,c, respectively, and 
PD, PE, PF by p, q, r, respectively (see Figure 1.11). We wish to minimize 
a/p + b/q + c/r. 


A 



Figure 1.11. 
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Notice that 

Area A ABC = Area A BCP + Area A CA P + Area A ABP 
= i ap + {bq + \cr 

ap + bq + cr 
= 2 — ‘ 

Thus, ap + bq + cr is a constant, independent of the placement of P. 
Therefore, instead of minimizing a/p + b/q + c/r, we will minimize 
(ap + bq + cr%a/p + b/q + cjf). (This step will appear more natural 
after a study of inequalities with constraints taken up in Section 7.3.) We 
have 

(qp + ^ + ci-^-a + l + ^J 

= a 1 + b 1 + c 1 + «£>( ^ ~ j + f' + ^) + '' ac ("7 + "^) 

> a 2 + b 1 + c 1 + lab + 2 be + 2oc 
= (a + b + cf. 

The inequality in the second step follows from the fact that for any two 
positive numbers x and y we have x/y + y/x > 2, with equality if and 
only if x = As a result of this fact, (ap + bq + cr\a/p + b/q 4- c/r) will 
attain its minimum value (a + b + c)~ when, and only when, p = q = r. 
Equivalently, a/p + b/q + c/r attains a minimum value when P is located 
at the incenter of the triangle. 


13 . 4 . Prove that if m and n are positive integers and 1 < k < n, then 


Y (n\( m \ (m + n 

l k 


) 


Solution. The statement of the problem constitutes one of the fundamental 
identities involving binomial coefficients. On the left side is a sum of 
products of binomial coefficients. Obviously, a direct substitution of facto¬ 
rials for binomial coefficients provides no insight. 

Quite often, finite series (especially those which involve binomial coeffi¬ 
cients) can be summed combinatorially. To understand what is meant here, 
transform the series problem into a counting problem in the following 
manner. Let S = A U B, where A is a set with n elements and B is a set, 
disjoint from A , with m elements. We will count, in two different ways, the 
number of (distinct) fc-subsets of S. On the one hand, this number is ('"*"). 
On the other hand, the number of ^-subsets of S with exactly / elements 
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from A (and k - i elements from B) is (-X*-;)- It follows that 

^ k n ) = ^°‘ *' su ^ sets °I $ 

k 

= ^ (No. of ^-subsets of S with i elements from A ) 

i = 0 

(Another solution to this problem, based on the properties of polynomials, 
is given in 4.3.2.) 

Counting problems can often be simplified by “identifying" (by means 
of a one-to-one correspondence) the elements of one set with those of 
another set whose elements can more easily be counted. The next three 
examples illustrate the idea. 

13 . 5 . On a circle « points are selected and the chords joining them in pairs 
are drawn. Assuming that no three of these chords are concurrent (except 
at the endpoints), how many points of intersection are there? 

Solution. The cases for » = 4,5,6 are shown in Figure 1.12. Notice that 
each (interior) intersection point determines, and is determined by, four of 
the given n points along the circle (these four points will uniquely produce 
two chords which intersect in the interior of the circle). Thus, the number 
of intersection points is Q. 

13 . 6 . Given a positive integer «, find the number of quadruples of integers 
{a,b,c,d) such that 0<a<b<c<d<n. 

Solution. The key idea which makes the problem transparent is to notice 
that there is a one-to-one correspondence between the quadruples of our set 





Figure 1.12. 
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and the subsets of four objects taken from {0, l,...,u + 3}. Specifically, 
let (a,b, c, d\ 0 < a < b < c < d < n, be identified with the subset {a,b + 
l,c + 2 ,d + 3). It is easy to see that this correspondence is one-to-one- 
each element of our set corresponds to exactly one subset of four from 
{0,1, .... n + 3}, and vice versa. Thus, the desired number is (" 4 +4 ). 

13.7. The number 5 can be expressed as a sum of 3 natural numbers, 
taking order into account, in 6 ways, namely, as5 = l + l+ 3 = l+ 3+ l 
= 3+ l + l = l+ 2 + 2 = 2+ l+ 2 = 2 + 2+l. Let m and n be natural 
numbers such that m < n. In how many ways can n be written as a sum of 
m natural numbers, taking order into account? 


Solution. Write n as a sum of n ones: 

M = 1 + 1 + ‘ • • +1. 

n 

The number we seek is the number of ways of choosing m — 1 plus signs 
from the n — 1; that is, 


Problems 

13 . 8 . Show that x 1 ~ 2x~ + 10x 2 - 1 has no root greater than 1. (Hint: 
Since it is generally easier to show that an equation has no positive root, we 
are prompted to consider the equivalent problem obtained by making the 
algebraic substitution x = y + 1.) 

13 . 9 . The number 3 can be expressed as a sum of one or more positive 
integers, taking order into account, in four ways, namely, as 3, 1 + 2, 2 + 1, 
and 1 + 1 + 1. Show that any positive integer n can be so expressed in 2” -1 
ways. 

13 . 10 . In how many ways can 10 be expressed as a sum of 5 nonnegative 
integers, when order is taken into account? (Hint: Find an equivalent 
problem in which the phrase “5 nonnegative integers” is replaced by “5 
positive integers”.) 

13 . 11 . For what values of a does the system of equations 

(x-a ) 2 + y 3 -l 

have exactly zero, one, two, three, four solutions, respectively? (Hint: 
Translate the problem into an equivalent geometry problem.) 
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13 . 12 . Given n objects arranged in a row. A subset of these objects is 
called unfriendly if no two of its elements are consecutive. Show that the 
number of unfriendly subsets each having k elements is (" _ * + 1 ). (Hint: 
Adopt an idea similar to that used in 1.3.6.) 

13 . 13 . Let a(n) be the number of representations of the positive integer n 
as a sum of l’s and 2’s taking order into account. Let b(n) be the number 
of representations of « as a sum of integers greater than 1, again taking 
order into account and counting the summand n. The table below shows 
that a(4) = 5 and 6(6) = 5: 


a-sums 6-sums 


1 + 1+2 
1 + 2+1 
2+1 + 1 
2+2 

1 + 1 + 1 + 1 


— 4 + 2 
3 + 3 
2+4 
2 + 2 + 2 
6 


(a) Show that a(n) = b(n + 2) for each «, by describing a one-to-one 
correspondence between the a-sums and 6-sums. 

(b) Show that a(l)=l, a(2) = 2, and for n > 2, a(n)=a(n- 1) + 
a(n — 2). 


13 . 14 . By finding the area of a triangle in two different ways, prove that if 
p u P 2 - P-i are the altitudes of a triangle and r is the radius of its inscribed 
circle, then \/p { + \/p 2 + l//> 3 = 1 / r. 

13 . 15 . Use a counting argument to prove that for integers r ,«, 0 < r < n, 

W + ( r : 1 ) + ( r : 2 ) + '" + (:)-(": 1 1 )- 


Additional Examples 

1.2.3, 5.1.5, 5.1.14, 7.4.6, 8.2.6. There are so many examples of this heuristic 
that it is difficult to single out those that are most typical. Noteworthy are 
the indirect proofs in Section 1.9, 1.10, 1.11, the congruence problems in 
Section 3.2, the limit problems in Section 6.8. Other examples of partial 
fractions (see 1.3.1) are 4.3,23, 5.3.1, 5.3.2, 5.3.3, 5.3.6, 5.3.12, 5.4.9, 5.4.13, 
5.4.20, 5.4.24, 5.4.25. Examples based on the identity jc = exp(logx) are 
5.3.7(c), 6.3.3, 6.7.1, 6.7.4, 6.7.5, 6.7.7, 6.9.5, 7.4.1, 7.4.2, 7.4.9, 7.4.20. 
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1 .4. Modify the Problem 

In the course of work on problem A we may be led to consider problem, 21. 
Characteristically, this change in problems is announced by such phrases as 
“it suffices to show that...” or “we may assume that...” or “without 
loss of generality . . . ”. In the last section we looked at examples in which 
A and B were equivalent problems, that is, the solution of either one of 
them implied the solution of the other. In this section we look at cases 
where the solution of the modified (or auxiliary) problem, problem B, 
implies the solution of A, but not necessarily vice versa. 

1.4.1. Given positive numbers a,b,c,d, prove that 

a 3 + b 3 + c 3 + b 3 + c 3 + d 3 + c 3 + d 3 + a 3 + d 3 + a 3 + b 3 
a + b + c b + c + d c + d + a d + a + b 

> a 2 + b 2 + c 2 + d 2 . 


Solution. Because of the symmetry in the problem, it is sufficient to prove 
that for all positive numbers x, y, and z 

x 3 +y 3 + z 3 ^ x 2 +y 2 + z 2 
x +y + z 3 

For if this were the case, the left side of the original inequality is at least 

a 2 + b 2 + c 1 . b 1 + c 2 + d 2 , c 2 + d 2 + a 2 . d 2 + a 2 + b 2 
3 + 3 3 3 

= a 2 + b 1 + c 2 + d 2 . 

Now, to prove this latter inequality, there is no loss of generality in 
supposing that x + y + z = 1. For if not, simply divide each side of the 
inequality by (x + y + z) 3 , and let A” = x /( x + y + z), Y = y/(x + y + z), 
and Z = z/(x + y + z). 

Thus, the original problem reduces to the following modified problem: 
Given positive numbers X, Y, Z such that X 4- Y + Z = 1, prove that 


x 3 + y 3 + z 3 


> 


x 2 + y 2 + z 2 

3 


(For a proof of this inequality, see 7.3.5.) 


1.4.2. Let C be any point on the line segment AB between A and B, and 
let semicircles be drawn on the same side of AB with AB, AC, and CB as 
diameters (Figure 1.13). Also let D be a point on the semicircle having 
diameter AB such that CD is perpendicular to AB, and let E and F be 
points on the semicircles having diameters AC and CB, respectively, such 



].4. Modify the Problem 


23 



that EF is a segment of their common tangent. Show that ECFD is a 
rectangle. 

Solution. Note that it is sufficient to show that A, E, and D are collinear (the 
same argument would show that B, F, and D are collinear). For if this were 
the case, Z AEC = 90° (E is on circle AEC), LADB - 90°, L CFB = 90°, 
and the result holds. It turns out, however, that without some insight, there 
are many ways of going wrong with this approach; it’s difficult to avoid 
assuming the conclusion. 

One way of gaining insight into the relationships among the parameters 
in a problem is to notice the effect when one of them is allowed to vary 
(problem modification). In this problem, let D vary along the circumfer¬ 
ence. Let G and H (Figure 1.14) denote the intersections of the segments 
AD and BD with the circles with diameters AC and CB (and centers O and 
O') respectively. Then Z AGC = LADB = L CHB = 90°, so that GDHC 
is a rectangle. Furthermore, Z OGC = Z OCG (A OGC is isosceles), and 
Z CGH = Z GCD because GH and CD are diagonals of a rectangle. 
Therefore, Z OGH — Z OCD. Now, as D moves to make CD perpendicu¬ 
lar to AB, Z OGH will also move to 90°, so that GH is tangent to circle O, 
and G coincides with E. A similar argument shows GH is tanget to circle 
O', so H = F. This completes the proof. (Note the phrase “a similar 



Figure 1,14. 
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argument,” another simplifying technique, has the same effect when placed 
after an argument as “it suffices to show that” has when placed before the 
argument.) 

Note that we have solved the problem by solving a more general 
problem. This is a common problem-solving technique; we will see more 
examples of it in Section 1.12. 


1 . 43 . Prove that there do not exist positive integers x, y,z such that 

x 2 + y 2 + z 2 = 2 xyz. 

Solution, Suppose x, y, and z are positive integers such that x 1 + y 2 + z 2 
— 2xyz. Since x 2 + y 2 + z 2 is even (= 2 xyz), either two of x, >\ and z are 
odd and the other even, or all three are even. Suppose x, y,z are even. 
Then there are positive integers x ] ,y v z l such that x = 2x ]J y = 2j 1 , z 
= 2 z v From the fact that (2x,) 2 4- (2j ( ) 2 + (2zj) 2 = 2(2x l X2_y 1 )(2z l ) it fol¬ 
lows that x ( , y l ,z l satisfy x 2 + y^ + z 2 = 2 2 x l y l z i . Again, from this equa¬ 
tion, if x } ,y ]t z ] are even, a similar argument shows there will be positive 
integers x 2 , _y 2 ,z 2 such that x 2 + y 2 + zj = 2 3 x 2 y 2 z 2 . 

Continue in this way. Eventually we must arrive at an equation of the 
form a 2 + b 2 + c 2 = 2"abc where not all of a,b,c are even (and hence two 
of a, b,c are even and one is odd). 

Thus, we are led to consider the following modified problem: Prove 
there do not exist positive integers x, y, z and n, with x, y odd, such that 

x 2 + y 2 + z 2 — 2 n xyz. 

(This is Problem 1.9.3.) 

1.4.4. Evaluate Ja > e~ xl dx. 

Solution. The usual integration techniques studied in first-year calculus will 
not work on this integral. To evaluate the integral we will transform the 
single integral into a double integral. 

Let 1 = j'^'e ~ x2 dx. Then 

-L‘°^ e ^’’ dxdy 
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Now change to an equivalent integral by switching to polar coordinates. 
We then have 



It follows that / = Vw" /2. 

A modified (auxiliary) problem can arise in many ways. It may come 
about with a change in notation (as in 1.4.4; see Section 1.5) or because of 
symmetry (as in 1.4.1; see Section 1.6). Often it is the result of “working 
backward” (see Section 1.8) or arguing by contradiction (as in 1.4.3; see 
Section 1.9). It is not uncommon to consider a more general problem at the 
outset (as in 1.4.2; see Section 1.12). Thus we see that problem modification 
is a very general heuristic. Because of this, we will defer adding more 
examples and problems, putting them more appropriately in the more 
specialized sections which follow. 


1.5. Choose Effective Notation 


One of the first steps in working a mathematics problem is to translate the 
problem into symbolic terms. At the outset, all key concepts should be 
identified and labeled; redundancies in notation can be eliminated as 
relationships are discovered. 

1.5.1. One morning it started snowing at a heavy and constant rate. A 
snowplow started out at 8:00 A.M. At 9:00 A.M. it had gone 2 miles. By 
10:00 A.M. it had gone 3 miles. Assuming that the snowplow removes a 
constant volume of snow per hour, determine the time at which it started 
snowing. 


Solution. It is difficult to imagine there is enough information in the 
problem to answer the question. However, if there is a way, we must 
proceed systematically by first identifying those quantities that are un¬ 
known. We introduce the following notation; Let t denote the time that has 
elapsed since it started snowing, and let T be the time at which the plow 
goes out (measured from / “ 0). Let x(f) be the distance the plow has gone 
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at time t (we are only interested in ,x(/) for / > T). Finally, let h(t) denote 
the depth of the snow at time t. 

We are now ready to translate the problem into symbolic terms. The fact 
that the snow is falling at a constant rate means that the depth is increasing 
at a constant rate; that is, 

JL 

= c, c constant. 
dt 

Integrating each side yields 

h(t) = ct + d, c,d constants. 

Since A(0) = 0, we get d = 0. Thus h(t) = ct. 

The fact that the plow removes snow at a constant rate means that the 
speed of the plow is inversely proportional to the depth at any time t (for 
example, twic^ the depth corresponds to half the speed). Symbolically, for 
t > T, 

dx _ k 
dt h(t) ’ 

= Jl = K 

ct t ’ 

Integrating each side yields 

x(t) = K\ogt + C, C constant. 

We are given three conditions: x = 0 when t = T, x = 2 when t = T + 1, 
and x = 3 when t = T + 2. With two of these conditions we can evaluate 
the constants K and C, and with the third, we can solve for T. It turns out 
(the details are not of interest here) that 

T = ^^ * ** 0.618 hours sa 37 minutes, 5 seconds. 

Thus, it started snowing at 7:22:55 A.M. 

1 . 5 . 2 . 

(a) If n is a positive integer such that 2n + 1 is a perfect square, show that 
n + 1 is the sum of two successive perfect squares. 

(b) If 3n + 1 is a perfect square, show that «+ 1 is the sum of three perfect 
squares. 


k constant 

b 

K - — constant. 
c 


Solution. By introducing proper notation, this reduces to a simple algebra 
problem. For part (a), suppose that 2n + 1 = s 2 , s an integer. Since s 2 is an 
odd number, so also is s. Let r be an integer such that s = It + 1. Then 
2/i + 1 = (2/ + l) 2 , and solving for n we find 


n 


(2/ + l) 2 - 1 
2 


4r 2 + At 
2 


= 2/ 2 + It. 
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A 



Consequently, 

n + 1 = It 2 + 2t + 1 = t 1 + (t + I) 2 . 

(b) Suppose 3* + 1 = s 2 , s an integer. Evidently, s is not a multiple of 3, 
so s = 3f ± 1 for some integer t. Then 3n + 1 = (3t ± l) 2 , and therefore 


n 


(3t ± 1) J — 1 _ 9 ,2 ±6 , 
3 3 


3f 2 ± It. 


Hence, 


n + i = 3 t 2 ± 2t + 1 = 2r 2 + (t ± 1) 2 = t 2 + t 2 + (t ± 1)1 


1.53, In triangle ABC, AB = AC, D is the midpoint of BC, E is the foot 
of the perpendicular drawn D to AC, and F is the midpoint of DE (Figure 
1.15). Prove that AF perpendicular to BE. 


Solution. We can transform the problem into algebraic terms by coordina- 
tizing the relevant points and by showing that the slopes m BE and m AF are 
negative reciprocals. 

One way to proceed is to take the triangle as it appears in Figure. 1.15: 
take D as the origin (0,0), A = (0, a), B = (- b, 0), and C = (b, 0). This is a 
natural labeling of the figure because it takes advantage of the bilateral 
symmetry of the isosceles triangle (see the examples in Section 1.6). How¬ 
ever, in this particular instance, this notation leads to some minor complica¬ 
tions when we look for the coordinates of E and F. 

A better coordinatization is to take A = (0,0), B — (4a, 4b), C = (4c, 0), 
as in Figure 1.16. Then a 2 + b 2 = c 1 , D = (2a + 2c, 2b), E = (2a + 2c, 0), 
and F = (2a + 2c, b). (Almost no computation here; all relevant points are 
coordinatized.) It follows that 

m AF m 3E = ( 2 (a + c) )( 4a - (2a + 2c) ) = a ^~ c 2 " ” 
and the proof is complete. 
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1.5.4. Let — 1 < a 0 < 1 and define recursively 


_ t + g n -i \‘ /Z 

"l 2 ) ’ 


n > 0. 


Let A„ = 4”(1 — a„). What happens to A n as n tends to infinity? 


Solution. Direct attempts to express a n in terms of a 0 lead to hopelessly 
complicated expressions containing nested sequences of radicals, and there 
is no way to condense them into a closed form. 

The key insight needed is to observe that there is a unique angle 0, 
0 < 8 < 7r, such that a 0 = cos 9. For this 9, 


a 


-( 


1 + cosfl 
2 



Similarly, 


«i 


I 1 + cos(0/2) ^ 

\ 2 j 




We can now compute 

“4"(1 - cos(0/2")) 

_ 4 n (l - cos(0/2"))(l + cos(0/2")) 
1 + cos(0/2") 


4" sin 2 (#/2") 

1 + cos(0/2") 

/ 0 2 W sin(tf/2") \ 2 

\ 1 +cos(0/2 fl ) ^ 9/2 n 

As n becomes large, 9 2 /{\ + cos(0/2")) tends to 9 1 / 2, and (sin(0/2"))/ 
{9/ 2") approaches I (recall that (sinjr)/x-» 1 as x >0), and therefore, A n 
converges to 9 2 /2 as n tends to infinity. 
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Problems 

1.5.5. Write an equation to represent the following statements: 

(a) At Mindy’s restaurant, for every four people who ordered cheesecake, 
there were five who ordered strudel. 

(b) There are six times as many students as professors at this college. 

1.5.6. Guy wires are strung from the top of each of two poles to the base 
of the other. What is the height from the ground where the two wires cross? 

1.5.7. A piece of paper 8 inches wide is folded as in Figure 1.17 so that one 
corner is placed on the opposite side. Express the length of the crease, L, in 
terms of the angle 6 alone. 

1.5.8. Let P { , P 2 , .. ., P l2 be the successive vertices of a regular dodeca¬ 
gon (twelve sides). Are the diagonals P } P 9 ,P 2 P U ,P 4 P l 2 concurrent? 

1.5.9. Use algebra to support your answers to each of the following. 

(a) A car travels from A to B at the rate of 40 miles per hour and then 
returns from B to A at the rate of 60 miles per hour. Is the average rate 
for the round trip more or less than 50 miles per hour? 

(b) You are given a cup of coffee and a cup of cream, each containing the 
same amount of liquid. A spoonful of cream is taken from the cup and 
put into the coffee cup, then a spoonful of the mixture is put back into 
the cream cup. Is there now more or less cream in the coffee cup than 
coffee in the cream cup? (This problem has an elegant nonalgebraic 
solution based on the observation that the coffee in the cream cup has 
displaced an equal amount of cream which must be in the coffee cup.) 

(c) Imagine that the earth is a smooth sphere and that a string is wrapped 
around it at the equator. Now suppose that the string is lengthened by 
six feet and the new length is evenly pushed out to form a larger circle 
just over the equator. Is the distance between the string and the surface 
of the earth more or less than one inch? 



Figure 1.17. 
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1 . 5 . 10 . A well-known theorem asserts that a prime p > 2 can be written as 
a sum of two perfect squares (p = m 2 + n 2 , with m and n integers) if and 
only if p is one more than a multiple of 4. Assuming this result, show that: 

(a) Every prime one more than a multiple of 8 can be written in the form 
x 2 + 16y 2 , x andy integers. 

(b) Every prime five more than a multiple of 8 can be written in the form 
(2x + y) 2 + 4 y 2 , x andy integers. 

Additional Examples 

1.1.10, 2.5.10, 3.2.15, 3.3.11, 3.3.28, 3.4.2, 3.4.4, 4.1.5, 6.4.2, 7.2.4, 8.1.15, 
8.2.3, 8.2.17. Also, see Sections 2.5 (Recurrence Relations), 3.2 (Modular 
Arithmetic), 3.4 (Positional Notation), 8.3 (Vector Geometry), 8.4 (Complex 
Numbers in Geometry). 


1.6. Exploit Symmetry 


The presence of symmetry in a problem usually provides a means for 
reducing the amount of work in arriving at a solution. For example, 
consider the product (a + b + c)(a 2 + b 1 + c 2 — ab — ac - be). Since each 
factor is symmetrical in a, b , c (the expression remains unchanged when¬ 
ever any pair of its variables are interchanged), the same will be true of the 
product. As a result, if a 3 appears in the product, so will b 3 and c 3 . 
Similarly, if a 2 b appears in the product, so will a 2 c, b 2 a, b 2 c, c 2 a, c 2 b, and 
each will occur with the same coefficient, etc. Thus, a quick check shows 
the product will have the form 

A (a 3 + b 3 + c 3 ) + B{a 2 b + a 2 c + b 2 a + b 2 c + c 2 a + c 2 b) + C{abc). 

It is an easy matter to check that A = 1, B = 0, and C = —3. 

1.6.1. Equilateral triangles A BK, BCL, CDM, DAN are constructed inside 
the square A BCD. Prove that the midpoints of the four segments KL, LM, 
MN, NK and the midpoints of the eight segments AK, BK, BL, CL, CM, 
DM, DN , AN are the twelve vertices of a regular dodecagon. 


Solution. The twelve vertices are indicated in Figure 1.18 by heavy dots; 
two of these vertices are labeled a and b as shown. 

Using the symmetry of the figure, it suffices to show that LbOK =15°, 
LaOb = 30°, and \aO\ = \bO\. 

Note that AN is part of the perpendicular bisector of BK, and therefore 
|XV|=*|Vi?|. Using symmetry it follows that MBN is an equilateral 
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triangle, say of side length s, and that Z CBN = 15°. Now consider triangle 
DBN\ note that Ob joins the midpoints of DB and DN, so Ob is parallel to 
BN and half its length. Thus \Ob\ = s/2 and LbOK = 15°. From this it is 
easy to check that Z aOb — L DOK — Z bOK = 45° - 15° = 30°, and | Oa\ 
= \KN\/2 = s/2. 

The presence of symmetry in a problem also provides a clarity of vision 
which often enables us to see and discover relationships that might be more 
difficult to find by other means. For example, symmetry considerations 
alone suggest that the maximum value of xy, subject to x + y = 1, x > 0, 
y > 0, should occur when x = y ~ \ (x and y are symmetrically related). 
This is an example of the principle of insufficient reason, which can be stated 
briefly as follows: “Where there is no sufficient reason to distinguish, there 
can be no distinction.” Thus, there is no reason to expect the maximum will 
occur when a is anything other than that is, closer to 0 or to 1. To verify 
this, let x = | + e. Then y — % — e, and, xy = +eKi ~ e )~4 In 
this form it is clear that the maximum occurs when e = 0; that is, x = y 
-*■ 

The next problem offers several additional examples of this principle. 

1 . 6 . 2 . 

(a) Of all rectangles which can be inscribed in a given circle, which has the 
greatest area? 
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(b) Maximize sin/l + siniH- sinC, where A, B, C are the measures of the 
three angles of a triangle. 

(c) Of all triangles of fixed perimeter, which has the greatest area? 

(d) Of all parallelepipeds of volume 1, which has the smallest surface area? 

(e) Of all n-gons that can be inscribed in a given circle, which has the 
greatest area? 

Solution, (a) The principle of insufficient reason leads us to suspect the 
rectangle of maximum area that can be inscribed in a circle is a square 
(Figure 1.19). To verify this, let x and y denote the length and width of the 
rectangle, and suppose without loss of generality that the units are chosen 
so that the diameter of the circle is unity. We wish to maximize xy subject 
to x 1 + y 2 = 1. It is equivalent to maximize x 2 }’ 2 subject to x 1 4- y =1. But 
this is the same problem as that considered prior to this example; the 
maximum value occurs when x 2 = y 2 = 5, that is, when the rectangle is a 
square. 

(b) Notice that the sum, sin A + sin B + sin C, is always positive (since 
each of the terms is positive), and it can be made arbitrarily small (in 
magnitude) by making A arbitrarily close to 180°. There is no reason to 
expect the maximum will occur at any point other than A = B = C = 60° 
(an equilateral triangle). A proof of this follows from the discussion in 2.4.1. 

In a similar manner, we suspect the answers to (c), (d), and (e) are an 
equilateral triangle, a cube, and a regular n-gon. Proofs for these conjec¬ 
tures are given in 7.2.1, 7.2.12, and 2.4.1. 

1 . 63 . Evaluate 


pr/2 dx 
•*> 1 + (tanx) 1 ^ 


Solution. Here is a problem that cannot be evaluated by the usual tech¬ 
niques of integration; that is to say, the integrand does not have an 
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antiderivative. However, the problem can be handled if we happen to 
notice that the integrand (Figure 1.20) is symmetric about the point (^ m, j). 

To show this is so (it is not obvious), let/(jc) = 1/(1 + (tanx)^). It suffices 
to show that /(x) + /(w/2 — x) = 1 for all x, 0 < x < vr/2. Thus, we 
compute, for r = /2 , 


/(*/ 2 “ *) + /(*) = 


_ 1 _ 

1 + tan r (^w - x) 


+ 


1 

1 + tan r x 


- -- +- 1 - 

1 + cot r x 1 + tan'x 

tan r x , 1 

1 + tan f x 1 + tan f x 


= 1 . 

It follows from the symmetry just demonstrated that the area under the 
curve on [0,( tt] is one-half the area in the rectangle (see Figure 1.20); that 
is, the integral is (V/2)/2 = w/4. 


Another way to take advantage of symmetry is in the choice of notation. 
Here are a couple of illustrations. 


1 . 6 . 4 . Let P be a point on the graph of v = f(x), where f is a third-degree 
polynomial; let the tangent at P intersect the curve again at Q; and let A be 
the area of the region bounded by the curve and the segment PQ. Let B be 
the area of the region defined in the same way by starting with Q instead of 
P. What is the relationship between A and B7 


Solution, We know that a cubic polynomial is symmetric about its inflec¬ 
tion point (see 8.2.17). Since the areas of interest are unaffected by the 
choice of coordinate system, we will take the point of inflection as the 
origin. Therefore, we may assume the equation of the cubic is 

f(x) = <rx 3 + bx, a 7*0 

(see Figure 1.21). 

Suppose x 0 is the abscissa erf P. It turns out that the abscissa of Q is 
—2x 0 . (We will not be concerned with the details of this straightforward 
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computation. There is, indeed, a very elegant way to arrive at this fact, but 
it uses ideas found in Section 4.3 (see 4.3.7).) 

A straightforward integration shows that the area 4 is equal to Kx$, 
where X" is independent of * 0 . (Again, the details of this computation are 
not of concern here.) 

We now can apply our previous conclusions to the point Q. The tangent 
at Q will intersect the curve at R, the abscissa of which evidently is 
~2(-2x 0 ) = 4x 0 , and the area B is equal to K{ - 2x 0 ) 4 = 16 Kxq = 164. 


1.6.5. Determine all values of x which satisfy 

tan* = tan(* + 10°)tan(* + 20°)tan(* + 30°). 


Solution. We will introduce symmetry by a simple change of variable. Thus, 
sety = x + 15°. The equation then is 

tan(y - 15°) = tan(y - 5°)tan(y + 5°)tan(y + 15°), 

which is equivalent to 

sin(y — 15°)cos( v + 15°) sin(y — 5°)sin(y + 5°) 
cos(y - 15°)sin(y + 15°) cos(y - 5°)cos(y + 5°) 

Using the identities 

sin4 cosi? = \ [sin(4 - B) + sin(4 + 5)], 
sin4 sini? = \ [cos(4 — B) — cos(4 + H)], 
cos4 cos B = £ [cos(4 - B ) + cos(4 + il)], 

sin{ -30°) + sin2y cos( —10°) — cos 2y 
sin(30°) + sin2y cos( —10°) + cos2y ’ 


we get 
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or equivalently, 

2sin2y - 1 _ cos 10° - cos2y 
2 sin 2 y + 1 cos 10° + cos 2y 

This simplifies to 

sin 4y = cos 10°, 

which implies that 

4y = 80° + 360° k, 100° + 360° k, k = 0, ± 1, ±2,_ 

x = 5°+90°fc, 10° + 90° A:, fc-0, ± 1, ±2, . .. - 

Problems 

1 . 6 . 6 . 

(a) Exploit symmetry to expand the product 

(x^y + y 2 z + z 2 x)(xy 2 + yz 2 + zx 2 ). 

(b) If x + y + z = 0, prove that 

(Substitute z = —x — y and apply the binomial theorem. For another 
approach, see 4.3.9.) 

1 . 6 . 7 . The faces of each of the fifteen pennies, packed as exhibited in 
Figure 1.22, are colored either black or white. Prove that there exist three 
pennies of the same color whose centers are the vertices of an equilateral 
triangle. (There are many ways to exploit symmetry and create “without 
loss of generality” arguments.) 

1 . 6 . 8 . Make use of the principle of insufficient reason to minimize x 2 + 
x 2 + ■ ■ ■ + x 2 , subject to the condition that 0 < x, < 1, and x, + x 2 + 

• ■ • + ,x„ = 1. Prove your conjecture. (For the proof, take x, = l/n + e,.) 

1 . 6 . 9 . A point P is located in the interior of an equilateral triangle ABC. 
Perpendiculars drawn from P meet each of the sides in points D, E, and F, 
respectively. Where should P be located to make PD + PE + PF a maxi¬ 
mum? Where should P be located to make PD + PE + PF a minimum? 
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Justify your answers. (Hint: It is helpful to reflect the figure about one of 
the sides. What happens to PD + PE + PF as P moves parallel to the line 
of reflection?) 

1 . 6 . 10 . In Figure 1.23, ABCD is a square, L ECD = L EDC = 15°. Show 
that triangle AEB is equilateral. (The key to this very beautiful problem is 
to create central symmetry. Specifically, add identical 15° angles on sides 
AB, BC, and AD (as on side CD ) and create a diagram much like that 
constructed in 1.6.1.) 

1 . 6 . 11 . The product of four consecutive terms of an arithmetic progression 
of integers plus the fourth power of the common difference is always a 
perfect square. Verify this identity by incorporating symmetry into the 
notation. 


Additional Examples 
1.4.1, 8.1.4, 8.1.5, 8.1.8, 8.2.3. 

1.7. Divide into Cases 


It often happens that a problem can be divided into a small number of 
subproblems, each of which can be handled separately in a case-by-case 
manner. This is especially true when the problem contains a universal 
quantifier (“for all x . . . ”). For example, the proof of a proposition of the 
form “for all integers ...” might be carried out by arguing the even and 
odd cases separately. Similarly, a theorem about triangles might be proved 
by dividing it into three cases depending upon whether the triangle is acute. 
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right, or obtuse. Occasionally, the subproblems can be arranged hierarchi¬ 
cally into subgoals, so that the first cases, once established, can be used to 
verify the succeeding stages. Such a procedure is called hillclimbing. 

In the early stages of analysis, it is good to think about how a problem 
might be subdivided into a small number of (hopefully) simpler subprob¬ 
lems. The heuristic of this section is often given in the following form: “If 
you can’t solve the problem, find a simpler related problem and solve it.” 

1.7.1. Prove that an angle inscribed in a circle is equal to one-half the 
central angle which subtends the same arc. 

Solution. We are given a circle, say with center O, and an inscribed angle 
APB; some examples are shown in Figure 1.24. We are to prove that in all 
instances LAPB = ^ LAOB. The three preceding figures represent three 
essentially different situations. Specifically, the center of the circle, O, is 
either inside LAPB (diagram 2), or outside LAPB (diagram 3), or on one 
of the rays of LAPB (diagram 1). We shall prove the theorem by 
considering each of these cases separately. 

Case 1. Suppose the center O is on PA. Then LAOB = LOPB + 
L OBP (exterior angle equals sum of opposite interior angles) =2 L OPB 
(A OPB is isosceles) =2 LAPB. The result follows. 

Case 2. If O is interior to LAPB (diagram 2), extend line PO to cut the 
circle at D. We have just proved that 2 LAPD = LAOD and 2 L DPB 
= L DOB. Adding these equations gives the desired result. 

Case 3. If O is exterior to LAPB (diagram 3), extend PO to cut the 
circle D. Then, using case 1, 2LDPB = L DOB and 2Z DP A = L DOA. 
Subtracting the second equation from the first yields the result. This 
completes the proof. 

1.7.2, A real-valued function/, defined on the rational numbers, satisfies 
for all rational x and y. Prove that f(x) “/(!) • x for all rational x. 
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Solution. We will proceed in a number of steps. We will prove the result 
first for the positive integers, then for the nonpositive integers, then for the 
reciprocals of integers, and finally for all rational numbers. 

Case I (positive integers). The result holds when x = 1. For x =2, we 
have /(2)=/(l + 1)-/(1)+/(1) = 2/(1). For *-3, /(3)-/(2 + 1) 
= /(2)+/(!) = 2/(1)+/(!) = 3/(1). It is clear that this process can be 
continued, and that for any positive integer m,/(m) = «/(l). (A formal proof 
can be given based on the principle of mathematical induction—see Chap¬ 
ter 2). 

Case 2 (nonpositive integers). First,/(0) =/(0 + 0) =/(0)+/(0). Sub¬ 
tract /(0) from each side to get 0=/(0); that is, /(0) = 0-/(l). Now, 
0 =/(0) = /(l + (- 1)) =/(l) +/(- 1). From this, we see that /( - 1) = 
— /(l). Similarly, for any positive integer n, f(n) + /(- n) = f(n + (- «)) 
= /(0) = 0, so that/( — n)= - nf( I). 

Case 3 (reciprocals). For x = t, we proceed as follows: /(l) =/() + 4) 
= /(l) +/(i) = 2/(1). Divide by 2 to get /($)-/(l)/2. For x = t, /(l) 
-/(* + i + 3 )=/( 3 >+/(i)+/(j) = 3 /(U or equivalently, /<$) = 
/(l)/3. In a similar way, for any positive integer n, /(I jn) = f(])/n. For 
x = — l/n, we have /(l/«) +/( - 1 /«) =/(l/n + (- 1/n)) = /(0) = 0, so 
/(-!/«)= -/(!)/«• 

Case 4 (all rationals). Let n be an integer. Then f(2/n) = /(I fit + 1 / n) 
= /(l/n) + /(!/«) = 2/(1 jn) = (2/«)/(!). Similarly, if m/n is any rational 
number, with m a positive integer and n an integer, then 



m times m times 


-"*(*)-7 *>> 

This establishes the result—a good example of hillclimbing. 


1.73. Prove that the area of a lattice triangle is equal to / + \ B — 1, where 
I and B denote respectively the number of interior and boundary lattice 
points of the triangle. (A lattice triangle is a triangle in the plane with 
lattice points as vertices.) 


Solution. This is a special case of Pick’s theorem (see 2.3.1). There are a 
number of ingenious proofs, each of which divide the set of lattice triangles 
into a few special types. One way to do this is to “circumscribe” about the 
triangle a rectangle with edges parallel to the coordinate axes. At least one 
vertex of the rectangle must coincide with a vertex of the triangle. Now it 
can be checked that every lattice triangle can be classified into one of the 
nonequivalent classes sketched in Figure 1.25. 
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D C C C 



A A 

4 5 

Figure 1.25. 


In the first class are those right triangles whose legs are parallel to the 
coordinate axes. The second class includes acute-angled triangles one of 
whose sides is parallel to a coordinate axis. Such triangles are the “sum” of 
two triangles from the first class. In the third class are the obtuse triangles 
which have one side parallel to a coordinate axis. They are the “difference” 
of two triangles from the first class. The fourth and fifth classes cover those 
triangles having no sides parallel to the coordinate axes. 

The proof of the result follows a hillclimbing pattern. To get started, let 
us consider the rectangle A BCD in case 1. Suppose that line segments AB 
and AD contain a and b lattice points, respectively, not counting their 
endpoints. Then, with I and B the interior and boundary points of A BCD, 

I + ^ B — 1 = ob + ^ (2q + lb + 4) — 1 
= ab + a + b + 1 
-(«+ 1 )( 6+ 1 ) 

= Area A BCD. 

Now suppose that AB, BC, and AC contain a, b c lattice points, 
respectively, not counting their endpoints, and suppose that ABC contains i 
interior points. Then rectangle ABCD has 2/ + c interior points, and we 
have, with I and B the interior and boundary points of A BC, 

/ + £ B — l=/ + y(u + 6 + c + 3) — 1 
= j(2 i + a + b + c + 1) 

= H( 2l ' + c > + 3(2a + 26 + 4)- 1] 

= j AieaABCD 
= Area. ABC. 

The other cases can be handled in a similar way; we leave the details to 
the reader. 



40 


l. Heuristics 


Problems 

1.7.4 (Triangle inequality). 

(a) Prove that for all real numbers x and y, |x + y| < |x| + \y\. 

(b) Prove that for all real numbers x, y, and z, |j: — y\ < jx — z\ + \y — z\. 

1.7.5. Find all values of x which satisfy 


1.7.6. Let S = {((3,8) + y(4, - 1) + A(5,4)| i,j,k are integers}, and T 
= {m(l,5) + n(0,7)| m,n are integers}. Prove that S = T. (Note: Ordered 
pairs of integers are added componentwise: (j,/) + (s',/') = (s + s',/ + /'), 
and n(s,t) = (ns,nt).) 

1.7.7. A real-valued function /, defined on the positive rational numbers, 
satisfies f(x +y)= /(*)/(>’) for all positive rational numbers x and y. 
Prove that f(x) = [/(l)] 1 for all positive rational x. 

1.7Determine F(x) if, for all real x and y, F(x)F{y) — F(xy) — x + y. 


Additional Examples 

1.1.7, 2.5.11c, 2.5.12, 2.5.13, 2.63, 3,2.14, 3.2.15, 3.2.16, 3.2.17, 3.2.18, 3.4.1, 

4.1.3, 4.1.4, 4.4.14, 4.4.29, 5.2.1, 5.3.14c, 6.5.4, 7.4.3, 7.6.2, 7.6.4, 7.6.10, 

8.2.4. Some particularly nice examples which reduce to the study of very 
special cases are 3.3.8, 3.3.9, 3.3.21, 3.3.22, 3.3.26. 


1.8. Work Backward 


To work backward means to assume the conclusion and then to draw 
deductions from the conclusion until we arrive at something known or 
something which can be easily proved. After we arrive at the given or the 
known, we then reverse the steps in the argument and proceed forward to 
the conclusion. 

This procedure is common in high-school algebra and trigonometry. For 
example, to find all real numbers which satisfy 2x + 3 = 7, we argue as 
follows. Suppose that x satisfies 2x + 3 = 7. Then, subtract 3 from each 
side of the equation and divide each side by 2, to get x = 2. Since each step 
in this derivation can be reversed, we conclude that 2 does indeed satisfy 
2x + 3 = 7 and is the only such number. 
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Often, in routine manipulations, such as in the previous example, an 
explicit rewriting of the steps is not done. However, it is important to be 
aware of what can, and what cannot, be reversed. For example, consider 
the equation fx + 1 — Jx — 1 = 2. (Here, as usual, the square root is 
i nterpre ted as the positive square root.) Write the equation in the form 
A + 1 = t/x - 1 +2, and square each side to get x + 1 = x — 1 + 
4i lx — 1 + 4, or equivalently, {x—~T = — [. Square a second time to get 
x — 1 = \ , or Jt = |. We conclude that if there is a number x such that 
fx + 1 — i lx — 1 = 2, it has to equal §. However, f does not satisfy the 
original equation. The reason for this is that the steps are not all reversible. 
Thus, in this example, we proceed from Jx - 1 = - A to - I = ]. When 
this is reversed, however, the argument goes from x — 1 = \ to *jx — 1 = \. 


1.8,1. Let a be a fixed real number, 0 < a < ir, and let 


F(0) = 


sin# + sin(# + a) 
cos# - cos(# + a) ’ 


0 < 9 < m — a. 


Show that F is a constant. (This problem arose in 1.2.1.) 


Solution. Suppose that f is a constant. Then F(8) = F{ 0) for all 0, 
0 < $ < m — a. That is, 


sin# + sin(g + a) sina 
cos# - cos(# + a) 1 - cosa ’ 

[sin# + sin(# + «)] [ 1 — cosa] = sin a[ cos# — cos(# + a)], 
sin 9 + sin(# + a) — sin 0 cos a — sin(# + a)cosa 
= sin a cos 8 — sin a cos (8 + a), 
sin# + sin(# + a) — [sin# cos a + sin a cos#] 

- [sin(# + a)cosa - sinacosf# + a)] = 0, 
sin 0 + sin(# + a) — sin(# + a) — sin(# + a — a) = 0. 


( 1 ) 

( 2 ) 

( 3 ) 

( 4 ) 
(5) 


The last equation is an identity. For the proof, we must reverse these steps. 
The only questionable step is from (2) to (1): the proof is valid only if we 
do not divide by zero in going from (2) to (1). But (1 - cosa) 0 since 
0 < a < ?r, and cos# - cos (8 + a) > 0 since 0 <9<9 + a<ir. The proof 
therefore can be carried out; that is, starting with the known identity at (5), 
we can argue (via steps (4), (3), (2), (1)) that for all #, 0 < 8 < it - a, 
F(9) — sina/(l — cosa) = constant. 
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1.8.2. If a,b,c denote the lengths of the sides of a triangle, show that 
3 (ab + be + ca ) < (o + b + cf < 4 (ab + be + ca ). 


Solution. Consider the leftmost inequality: 

3 (ab + be + ca) < (a + b + c) 1 , 

3 (ab + be + ca) < a 2 + b 2 + c 2 + 2(ab + be + ca), 

ab + be + ca < a 2 + b 1 + c 2 , 

a 2 + b 1 + c 2 — ab — be — ca > 0, 
la 2 + 2b 1 + 2c 2 — 2ab — 2be — 2ca > 0, 

(a 2 - 2ab + b 2 ) + ( b 2 - 2 be + c 2 ) + (c 2 - 2ca + a 2 ) > 0, 

(a - b) 2 + (b — cf + (c - a) 2 > 0. 

This last inequality is true for all values of a,b,c. Now consider the right 
inequality: 

{a + b + cf < 4 (ab + be + ca), 

a 2 + b 2 + c 2 + 2(ab + be + ca) < 4 (ab + be + ca), 

a 2 + b 2 + c 2 < 2 {ab + be + ca), 

a 2 + b 2 + c 2 < a{b + e) 4- b(a + c) + c(b + a). 

This final inequality is true, since the sum of any two sides of a triangle is 
larger than the remaining side. Thus, a 2 < a{b + c), b 2 < b(a + c), and 
c 2 < c(b + a). 

The steps in each of these arguments can be reversed, so the proof is 
complete. 


1.83. Given: A OB is a diameter of the circle O; BM is tangent to the 
circle at B\ CF is tangent to the circle at E and meets BM at C; the chord 
AE, when extended, meets BM at D. Prove that BC = CD. (See Figure 
1.26.) 

Solution. Suppose BC = CD. Then CE = CD, since BC = CE (tangents 
from C to the circle at E and B are equal). Thus, Z CED = L CDE (base 
angles of an isosceles triangle are equal). We are led to consider the angles 
as labeled in Figure 1.26. 

Now, Ld is complementary to La since A ABD is a right triangle, and 
Le is complementary to Lc since LBEA is a right angle {AOB is a 
diameter). Therefore, La = Lc. But we know that La = Lc, since they 
both cut off the equal arc BE on the circle O. 
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M 

Figure 1.26. 

The proof can now be completed by reversing these steps. Thus (omit¬ 
ting reasons), Z a = Z c, and therefore, Z e = Z d. Hence CD = CE, CE 
= BC, and therefore BC = CD. 

1.8.4, In a round-robin tournament with n players P i ,P 2 , ■ ■ ■, P„, where 
n > 1, each player plays one game with each of the other players and rules 
are such that no ties can occur. Let W r and L r be the number of games won 
and lost, respectively, by player P r . Show that 

£ >n 2 = £z;. 

r-i r=l 

Solution. Suppose 2"-i W} = Cj. Then, 

£(>n 2 -4 2 )=o, 

r~ 1 - 

t(K~L r )(W r +L r ) = 0. 

r = ] 

But W r + L r = n — 1 for each r, so 

( «-i)S(^-4) = o, 

r = 1 

i(^ r -4)=o, 

r= I 

n n 

2 K= 2 L r- 
r=\ r- I 

This last equation is true, since the total number of games won by the n 
players has to equal the total number of games lost. The proof follows on 
reversing the preceding argument. 
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Problems 


1.8.5. 


(a) 


(b) 


Given positive real numbers x and _v, prove that 

2 - ' ^ " 


\/x + \/y 




t f J 

Given positive real numbers a and b such that a + b = 1, prove 

, ^ - < ax + by, x > 0, y > 0. 

a/x + b/y 


that 


1 . 8 . 6 . 


(a) If a,b,c are positive real numbers, and a < b + c, show that 

—-— < —£— + -C- . 

1 + a 1 + b 1 + c 

(b) If a, b, c are lengths of three segments which can form a triangle, show 
that the same is true for 1 /(a + c), \/(b + c), 1 /(a + b ). 

1.8.7. Two circles are tangent externally at A, and a common external 
tangent touches them at B and C. The line segment BA is extended, 
meeting the second circle at D. Prove that CD is a diameter. 


1.8.8. Consider the following argument. Suppose 8 satisfies 


Then, since 


cot# + tan 38 = 0. 


it follows that 


tan(a + jS) = 


tan a + tan/3 
1 - tan a tan ’ 


cot 8 + tan JL + . tan 2# _ 
1 — tan 8 tan 28 

cot#(l - tan#tan2#) + tan# + tan2# = 

cot 8 — tan 2# + tan 8 + tan 2# = 
cot# + tan# = 
1 + tan^# = 


0 , 

0 , 

0, 

0, 

0 , 


tan^# = — 1. 


Since this last equation cannot hold, the original equation does not have a 
solution (we don’t need to reverse any steps because the final step doesn’t 
yield any contenders). However, 8 = \ir does satisfy cot# + tan3# = 0. 
What’s wrong with the argument? 


1.8.9. With Euclidean tools (straightedge and compass), inscribe a square 
in a given triangle so that one side of the square lies on a given side of the 
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triangle. (Hint: Begin with the square and construct a triangle around it 
similar to the given triangle. Then use the fact that similar figures have 
proportional parts.) 


Additional Examples 

2.1.5, 7.1.1, 7.4.6. Also, see Section 2.2 (Induction) and Section 2.5 (Recur¬ 
sion). 


1,9. Argue by Contradiction 


To argue by contradiction means to assume the conclusion is not true and 
then to draw deductions until we arrive at something that is contradictory 
either to what is given (the indirect method) or to what is known to be true 
(reduclio ad absurdum). Thus, for example, to prove y5" is irrational, we 
might assume it is rational and proceed to derive a contradiction. The 
method is often appropriate when the conclusion is easily negated, when 
the hypotheses offer very little substance for manipulation, or when there is 
a dearth of ideas about how to proceed. 

As a simple example of this method of proof, consider the following 
argument which shows that the harmonic series diverges. Suppose on the 
contrary, that it converges—say to r. Then 

r =1 + i + i + i + i + ^ + i + i+ ... 

= I + i ■+ i + i +■•■ 

= r, 

a contradiction. We are forced to conclude that the series diverges. 


1 . 9 . 1 . Given that a,b,c are odd integers, prove that equation ax 1 + bx + c 
- 0 cannot have a rational root. 


Solution. Suppose pf q is a rational root, where (without loss of generality) 
p and q are not both even integers. We will first establish that neither p nor 
q is even. For suppose that p is even. From a(p/qf + b(p/q) + c = 0 we 
find that ap 2 + bpq + cq 2 = 0. Since ap 2 + bpq is even, cq 2 must be even, 
but this is impossible, since c and q are both odd. We get a similar 
contradiction if we suppose q is even. Therefore, both p and q are odd and 
ap 2 + bpq + cq 2 — 0. But this last equation states that the sum of three odd 
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numbers is zero, an impossibility. Therefore, the equation has no rational 
root. 

It is instructive to consider another proof of this result. The roots of 
ax 2 + bx + c = 0 are rational if and only if b 2 — 4 ac is a perfect square. So, 
suppose that b 2 — 4ac = (2n + l) 2 for some integer n (by supposition, 
b 2 — 4ac is odd, and therefore, if it is a square, it must be the square of an 
odd integer). Collecting multiples of 4 we have 

b 2 — 1 — 4 [n(/t + 1) + ac]. 

Since either n or n + 1 is even, n(n + 1) + ac is odd. Thus, the right side of 
the last equation is divisible by 4 but not by 8. However, the left side is 
divisible by 8, since b 2 — 1 = (b — lXh + 1) and one of b — 1 and b + 1 is 
divisible by 4, while the other is divisible by 2. Therefore the displayed 
equation above cannot hold, and we have a contradiction. (In this proof, 
we have reached a contradiction by looking at how two numbers stand 
relative to multiples of 8, rather than multiples of 2 as in the first proof. We 
will return to a deeper consideration of this idea in Section 3.2.) 

The next two sections contain additional illustrations of proof by contra¬ 
diction. 


Problems 

1.9,2. In a party with 2000 persons, among any set of four there is at least 
one person who knows each of the other three. There are three people who 
are not mutually acquainted with each other. Prove that the other 1997 
people know everyone at the party. (Assume that “knowing” is a symmetric 
relation; that is, if A knows B then B also knows A . What is the answer if 
“knowing” is not necessarily symmetric?) 

1 . 93 . Prove that there do not exist positive integers a, b, c, and n such that 
a 2 + b 2 + c 2 = 2 n abc. (From 1.4.3, we may assume that a and b are odd 
and c is even. How are the sides of the equation related to 4?) 

1 . 9 , 4 . Every pair of communities in a county are linked directly by exactly 
one mode of transportation: bus, train, or airplane. All three modes of 
transportation are used in the county; no community is served by all three 
modes, and no three communities are linked pairwise by the same mode. 

Four communities can be linked according to these stipulations in the 
following way: bus, AB, BC, CD, DA; train, AC; airplane, BD. 

(a) Give an argument to show that no community can have a single mode 
of transportation leading to each of three different communities. 

(b) Give a proof to show that five communities cannot be linked in the 
required manner. 
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1.9.5. Let S be a set of rational numbers that is closed under addition and 
multiplication (that is, whenever a and b are members of S, so are a ■+• b 
and ab ), and having the property that for every rational number r exactly 
one of the following three statements is true: r £ S, — r £ S, r = 0. 

(a) Prove that 0 does not belong to S. 

(b) Prove that all positive integers belong to S. 

(c) Prove that S is the set of all positive rational numbers. 


Additional Examples 

1.5.10, 1.6.7, 3.2.1, 3.2.6, 3.2.11, 3.2.13, 3.2.15, 3.2.17, 3.2.18, 3.3.4, 3.3.14, 
3.4.2, 4.1.3, 4.4.6, 5.4.1. Also, see Section 1.10 (Parity) and Section 1.11 
(Extreme Cases). 


1.10. Pursue Parity 


The simple idea of parity—evenness and oddness—is a powerful problem¬ 
solving concept with a wide variety of applications. We will consider some 
examples in this section, and then generalize the idea in Section 3.2. 


1.10.1. Let there be given nine lattice points in three-dimensional Euclid¬ 
ean space. Show that there is a lattice point on the interior of one of the 
line segments joining two of these points. 


Solution. There are only eight different parity patterns for the lattice points: 
(even, even, even), (even, even, odd), . . . , (odd, odd, odd). Since there are 
nine given points, two of them have the same parity pattern. Their mid¬ 
point is a lattice point, and the proof is complete. 

1.10.2. Place a knight on each square of a 7-by-7 chessboard. Is it possible 
for each knight to simultaneously make a legal move? 

Solution. Assume a chessboard is colored in the usual checkered pattern. 
The board has 49 squares; suppose 24 of them are white and 25 are black. 

Consider 25 knights which rest on the black squares. If they were to each 
make a legal move; they would have to move to 25 white squares. However, 
there are only 24 white squares available, therefore such a move cannot be 
made. 
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Figure 1.27. 


1 . 10 . 3 . Place a knight on a 4-by-n chessboard. Is it possible, in 4 n 
consecutive knight moves, to visit each square of the board and return to 
the original square? 


Solution. Before considering this problem, it is interesting to consider the 
same question for the 7-by-7 chessboard. Suppose that such a “closed tour” 
is attempted. On the first move the knight moves to a square of the opposite 
color; on the second move it returns to a square of the same color; and so 
forth. We see that after an odd number of moves the knight will occupy a 
square opposite in color from its original square. Now a closed tour of the 
7-by-7 board requires 49 moves, an odd number. Therefore the knight 
cannot occupy its original square, and the closed tour is impossible. 

Consider, now, the 4-by-n board. The argument fof the 7-by-7 does not 
carry over to this case, because 4« is an even number. To handle this case, 
color the 4-by-rt board in the manner indicated in Figure 1.27. 

Notice that knight moves made from the white squares in the top and 
bottom rows lead to white squares in the second and third rows. Con¬ 
versely, in a tour of the required type, knight moves from the inner two 
rows must necessarily be to the white squares in the outer two rows. This is 
because there are exactly n white squares in the outer two rows, and these 
can be reached only from the n white squares in the inner two rows. 
Therefore, the knight path can never move from the white squares to the 
black squares, and so such a closed tour is impossible. 

1 . 10 . 4 . Let n be an odd integer greater than 1 , and let A by an n-by-n 
symmetric matrix such that each row and each column of A consists of 
some permutation of the integers 1,. ., , «. Show that each one of the 
integers 1, . .., n must appear in the main diagonal of A. 


Solution. Off-diagonal elements occur in pairs (because A is symmetric. 
Each number appears exactly n times, and this, together with knowing that 
n is odd, implies the result. 

1 . 10 . 5 . Let a x ,a 2 , . .., a 2 „+ 1 be a set of integers with the following prop¬ 
erty (P): if any of them is removed, the remaining ones can be divided into 
two sets of n integers with equal sums. Prove that a, = a 2 = ■ ■ * — a 2n+ ,. 
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Solution. First, observe that all of the integers a h . . ., a 2n+l have the same 
parity. To see this, let A = a x + ■ ■ ■ + a 2n+l . The claim follows after 
noting that for each /, A — a t is even (otherwise the remaining numbers 
could not be divided in the required manner). 

Let a denote the smallest number among %.« 2 „ + ,, and for each i, 

let b, = a, — a. The problem is equivalent to showing that b- = 0 for all i. 

Now b l ,b 2 , . .., b 2n + 1 satisfy property (P). Since one of them is zero, it 
must be the case that they all are even. If they are not all zero, let k be the 
largest positive integer for which 2* divides each of the b>. For each i, let 
c i — b,/2 k . Then c { ,c 2 , .. ., c 2n+1 satisfy (P); however, they don’t all have 
the same parity (since one of them is zero, and another is odd because of 
the choice of k). Therefore, all the b { are zero and the proof is complete. 


Problems 

1.10.6. 

(a) Remove the lower left comer square and the upper right comer square 
from an ordinary 8-by-8 chessboard. Can the resulting board be cov¬ 
ered by 31 dominos? Assume each domino will cover exactly two 
adjacent squares of the board. 

(b) Let thirteen points Pi, ..., P 13 be given in the plane, and suppose they 

are connected by the segments PyP 2 , P 2 P 3 , • ■ •, I s d 

possible to draw a straight line which passes through the interior of 
each of these segments? 

1.10.7. 

(a) Is it possible to trace a path along the arcs of Figure 1.28(a) which 
traverses each arc once and only once? (Hint: Count the number of 
arcs coming out of each vertex.)' 

(b) Is it possible to trace a path along the lines of Figure 1.28(b) which 
passes through each juncture point once and only once? (Hint: Color 
the vertices in an alternating manner.) 

1 . 10 . 8 . Let a u a 2 , . . ., a n represent an arbitrary arrangement of the num¬ 
bers 1,2, Prove that, if n is odd, the product 

(«i - 1)(«2~ 2) • • • (a„ - n) 

is an even number. 

1.10.9. Show that (2° — 1X2* — 1) = 2 r +l is impossible in nonnegative 
integers a, b, and c. (Hint: Write the equation in the equivalent form 
2 a+b — 2“ — 2 b = 2 r and investigate the possibilities for a, b, and c.) 

1 . 10 . 10 . Show that x 2 — y 2 = a 3 always has integral solutions for x and y 
whenever a is a positive integer. 
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Figure 1.28. 


Additional Examples 

1.5.10, 1.9.1, 2.2.7, 3.2.13, 3.3.4, 3.3.20, 4.2.16(a), 4.3.4, 7.4.6. See Section 3.2 
for a generalization of this method. 


1.11. Consider Extreme Cases 

In the beginning stages of problem exploration, it is often helpful to 
consider the consequences of letting the problem parameters vary from one 
extreme value to another. In this section ,we shall see that the existence of 
extreme positions are often the key to understanding existence results 
(problems of the sort “prove there is an x such that P(x)"). 


1.11.1. Given a finite number of points in the plane, not all collinear, 
prove there is a straight line which passes through exactly two of them. 

Solution. If P is a point and L a line, let d{P, L) denote the distance from P 
to L, Let 5 denote the set of positive distances d{P, L) as P varies over the 
given points, and L varies over those lines which do not pass through P but 
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which do pass through at least two of the given points. The set S is 
nonempty (because the given points are not all collinear) and finite (there 
are only a finite number of points and a finite number of lines which pass 
through at least two such points). Therefore 5 has a minimal element, say 
d(P,M). We claim that M passes through exactly two of the given points. 

Suppose that M passes through three of the given points, say P v P 2 , and 
P 3 . Let Q denote the point on M which is closest to P. At least two of the 
points P u P 2 , Pt. lie on the same side of Q (one may equal Q), say P 2 and 
P j (see Figure 1.29). Suppose the points are labeled so that P 2 is closer to P 
than P 3 . Now let N denote the line through P and P 3 , and note that 
d(P 2i N) < d{P, M), a contradiction to our choice of P and M. It follows 
that M can only pass through two of the given points. 


1.11.2. Let A be a set of 2n points in the plane, no three of which are 
collinear. Suppose that n of them are colored red and the remaining n blue. 
Prove or disprove: There are n closed straight line segments, no two with a 
point in common, such that the endpoints of each segment are points of A 
having different colors. 

Solution. If we disregard line crossings, there are a number of ways the 
given n red points can be paired with the given blue points by n closed 
straight line segments. Assign to each such pairing the total length of all the 
line segments in the configuration. Because there are only a finite number 
of such pairings, one of these configurations will have minimal total length. 
This pairing will have no segment crossings. (If segments /?,£, and R 2 B 2 
intersected, /?,, R 2 being red points and B h B 2 blue points, then we could 
reduce the total length of the configuration by replacing these segments 
with R l B 2 and R 1 B i ,) (For another solution, see 6.2.3.) 


1.113. At a party, no boy dances with every girl, but each girl dances with 
at least one boy. Prove there are two couples bg and b'g' which dance, 
whereas b does not dance with g' nor does g dance with b'. 


Solution. Although not necessary, it may make the problem more under¬ 
standable if we interpret the problem in matrix terms. Let the rows of a 
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matrix correspond to the boys and the columns to the girls. Enter a 1 or 0 
in the b-tow and ^-column according to whether b and g dance or don’t 
dance with one another. The condition that no boy dances with every girl 
implies that (i) every row has at least one 0 entry. Similarly, (ii) every 
column has at least one 1 entry. We wish to prove that there are two rows, 
b and b', and two columns, g and g\ whose entries at their intersection 
points have the pattern 



Let h denote an arbitrary row. By (i) there is a 0 entry in this row, say in 
column k. and by (ii) there is a 1 entry in column k, say in row m\ 


k 



Now, we’re done if there is a column which contains a 1 in row h and a 0 in 
column m. In general, such a column may not exist. However, if h had been 
chosen in advance as a row with a maximal number of I’s, then such a 
column would have to exist, and the problem would be solved. 

With this background, we can rewrite the solution in language indepen¬ 
dent of the matrix interpretation. Let b be a boy who dances with a 
maximal number of girls. Let g' be a girl with whom b does not dance, and 
b' a boy with whom g' dances. Among the partners of b, there must be at 
least one girl g who does not dance with b' (for otherwise b' would have 
more partners than b). The couples bg and b'g' solve the problem. 


1.11.4, Prove that the product of n successive integers is always divisible 
by nl. 


Solution. First, notice that it suffices to prove the result for n successive 
positive integers. For the result is obviously true if one of the integers in the 
product is 0, whereas if all the integers are negative, it suffices to show that 
n! divides their absolute value. 
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So suppose there are n successive positive integers whose product is not 
divisible by «!. Of all such numbers n, choose the smallest; call it N. Note 
that N >2, since the product of two successive integers is always even. We 
are supposing, therefore, that there is a nonnegative integer m such that 
( m + IKm + 2) ■ ■ ■ (m + N) is not divisible by N\. Of all such numbers m, 
let M be the smallest. Note that M > 0, since N ! is divisible by N !. Thus, 
we are supposing that (M 4- \\M + 2) ■ • ■ (M + N) is not divisible by N !. 
Now, 


(M + \)(M + 2) ■ ■ ■ (M+ N - l)(M + N) 

= M[(M + l)(M + 2) ■ ■ ■ (M + N - 1 )] 

+ 1V[(A/+ \){M + 2) • • • (A/+ N- 1)]. 

By our choice of M, n\ divides M[{M + \)(M + 2) • • • (M + N — 1)]. By 
our choice of N, (N - 1)! divides (M + 1 ){M + 2) • ■ • (M + N - 1), and 
consequently, N! divides N[(M + 1 )(M + 2) ■ ■ ■ (M + N - 1 )]. Combin¬ 
ing, we see that N\ divides the right side of the last equation, contrary to 
our supposition. This contradiction establishes the result. 

(A slick proof of this result is to recognize that the quotient (m + \)(nt + 
2 )••*(»! + ri)/n\ is equal to the binomial coefficient and is there¬ 

fore an integer if m is an integer.) 


Problems 

1 . 11 . 5 . Let /(a;) be a polynomial of degree n with real coefficients and such 
that f(x) > 0 for every real number a. Show that f(x)+f'(x) + 

• • ■ 4- f ln -(x) > 0 for all real *. (f (k) (x) denotes the fcth derivative of 
/(*)•) 

1 . 11 . 6 . Give an example to show that the result of 1,11.1 does not 
necessarily hold for an infinite number of points in the plane. Where does 
the proof'of 1.11.1 break down for the infinite case? 

1 . 11 . 7 . Show that there exists a rational number, c/d, with d < 100, such 
that 



for Jt= 1,2,3,..., 99. 


1.11,8, Suppose that P n is a statement, for n = 1,2,3, .... Suppose fur¬ 
ther that 


(i) P t is true, and 

(ii) for each positive integer m, P m+l is true if P m is true. 
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Prove that P n is true for all n. (Hint: Let S denote the set of all positive 
integers for which P n is not true. Let m denote the smallest element in S, 
assuming that S is nonempty.) 


Additional Examples 

3.1.9, 3.3.11, 3.3.28, 4.4.7, 4,4.10, and the referrals given in 6.3.7. Also, see 
Sections 7.6 (The Squeeze Principle) and 6.2 (The Intermediate-Value 
Theorem) for examples which require consideration of “extremelike” cases. 


1.12. Generalize 

It may seem paradoxical, but it is often the case that a problem can be 
simplifed, and made more tractable and understandable, when it is general¬ 
ized. This fact of life is well appreciated by mathematicians; in fact, 
abstraction and generalization are basic characteristics of modem mathe¬ 
matics. A more general setting provides a broader perspective, strips away 
nonessential features, and provides a whole new arsenal of techniques. 

1 . 12 . 1 . Evaluate the sum 


Solution. We will instead evaluate the sum $(x) = 2* = i^ 2 ** an d then 
calculate 5( V)- The reason for introducing the variable x is that we can 
now use the techniques of analysis. We know that 


n 



1 - x n+l 

1 - x 


X# 1. 


Differentiating each side we get 


n 


2 fcc*- 1 * 
A-l 


(1 - x)(-(n + l)x") + (l - x" +1 ) 

o - *) 2 


1 -(« + l)x" + «x" +l 

(1 - X) 2 

Multiplying each side of this equation by x, differentiating a second time, 
and multiplying the result by x yields 

" , L x(l+x) — x" +l (rtx-/t — l) 2 — x" +2 

S(x) = 2 kbcK- -L —I --i —--—l -. 

k-\ (1 — X) 
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It follows that 


m 


if-6-yiT(i»-"-') 2 -^rrT 


6 - ( «L t . 4; ± 6 ). 


1.12.2. Evaluate the following determinant (Vandermonde’s determinant): 


1 

1 


det 


a, aj 
a 2 a 2 


a 


n - I 
2 


1 


I 


Solution. We will assume that a, # a p i ¥= j, for otherwise the determinant 
is zero. In order to more clearly focus on the main idea, consider the case 
n = 3: 


det 


1 

1 

l 


a a 


2 


b b 2 


c c 


2 


In this determinant, replace c by a variable jc. Then, the determinant is a 
polynomial P(x ) of degree 2. Moreover, P(a) = 0 and P(b) = 0, since the 
corresponding matrix, with c replaced by a or b respectively, then has two 
identical rows, Therefore, 


/*(x) = A (x. — — b) 


for sopie constant A. Now, A is the coefficient of x z , and, returning to the 
determinant, this coefficient is 


det 



Thus, A = b — a, and the original 3-by-3 determinant is 
/>(c) = (*-u)[(c-u)(c-6)]. 

The general case is analogous. Let D n denote the desired determinant (of 
order n). Replace a„ in the bottom row of the matrix by the variable x. The 
resulting determinant is a polynomial P„(x) of degree n — 1, which vanishes 
at oj, a 2 ,. .. Hence, by the Factor Theorem (see Section 4.2), 

**«(*) = - fliX* - a l) ‘ ’ ‘ (* “ a «-])> 
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where A is a constant. As before, A is the coefficient of x", and expanding 
along the bottom row makes it clear that A = D n _ l . That is, 

£>n - = A,-i[K - ~ a i) ■ ‘ ’ (<** ~ o»-i)]- 

We can repeat the argument for etc. The final result will be 

n TJt-l 

A, = n IIH - a d ■ 

k-2 i-l 


1.123. Given that /“(sin xJ/xdjt = 4 evaluate /®(sin 2 x)/x 2 rfx. 


Solution. We will evaluate the more general integral 

a >0, 

by using a technique called parameter differentiation. 

Differentiating each side of the previous equation with respect to a, we 
get 


r{a)= ^lmM^ax_x dx 


Jo X 

Now, with y = 2 ax, we get dy — 2a dx, and 

sin y 


Integrating each side gives 


7(u) = + C, C constant. 

Since 7(0) = 0, we get C = 0. Thus 1(a) = \ it a, a > 0. Setting a = 1 yields 
7(1) = /”(sin 2 x)jc 2 dx = \ ir. (Incidentally, the value of /“(sinx)/j <dx can 
be found by evaluating a more general integral—an integral of a complex¬ 
valued function over a contour in the complex plane.) 


Problems 

1,12.4. By setting x equal to the appropriate values in the binomial 
expansion 

< i+ *>■-&:)** 
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(or one of its derivatives, etc.) evaluate each of the following: 


<•> % k ii\ < b > J/U) 


1.12.5. Evaluate 


det 


1 

1 

1 

1 


a a 
b b 1 
c c 2 
d d 1 


a 
b 4 
c 4 
d 4 


(Replace d by a variable make use of the fact that the sum of the roots 
of a fourth-degree polynomial is equal to the coefficient of (see Section 
4.3).) 


1 . 12 . 6 . 


(a) Evaluate /"(e _ *sinjf)/jc dx. (Consider G(k) = 
and use parameter differentiation.) 

(b) Evaluate fy(x — \)/\nxdx. (Consider H(m) = (l(x m — l)/]nxdx and 
use parameter differentiation.) 

(c) Evaluate 

arctan(wjc) — arctanx 

Jo x 

(Consider F(a) = f“(arctan(o>r) — arctan x)/xdx and use parameter 
differentiation.) 

1,12.7. Which is larger ^60 or 2 + .^7? (Cubing each number leads to 
com plica tipfis that are not easily resolved. Consider instead the more 

general,problem: Which is larger, ij4(x + >-■) or %/x + tfy 3 where x, y 
> 0? Take * = a 3 ,y = b 3 .) 


Additional Examples 

1.4.2, 2.2.6, 2.2.7, 4.1.4, 5.1.3, 5.1.4, 5.1.9, 5.1.11, 5.4.4, 5.4.5, 5.4.6, 5.4.7, 

6.9.2, 7.4.4. Also, see Section 2.4 (Induction and Generalization). 



Chapter 2. Two Important Principles: 
Induction and Pigeonhole 


Mathematical propositions come in two forms: universal propositions 
which state that something is true for all values of x in some specified set, 
and existential propositions which state that something is true for some 
value of x in some specified set. The former type are expressible in the form 
“For all x (in a set 5), P(x)”; the latter type are expressible in the form 
“There exists an x (in the set 5) such thatP(x),” where P(x) is a statement 
about x. In this chapter we will consider two important techniques for 
dealing with these two kinds of statements: (i) the principle of mathemati¬ 
cal induction, for universal propositions, and (ii) the pigeonhole principle, 
for existential propositions. 


2.1. Induction: Build on P{k) 


Let a be an integer and P(n) a proposition (statement) about n for each 
integer n > a. The principle of mathematical induction states that: 

if 


(i) P{a) is true , and 

(ii) for each integer k > a, P(k) true implies P(k + 1) true , 
then P(n) is true for all integers n > a. 

Notice that the principle enables us, in two simple steps, to prove an infinite 
number of propositions (namely, P(n) is true for all integers n > a). The 
method is especially suitable when a pattern has been established (see 
Section 1.1, “Search for a Pattern”) for the first few special cases (P(a), 


CO 
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P(a + 1), P(a + 2), .. .). In this section we consider induction arguments 
which, in step (ii), proceed directly from the truth of P(k) to the truth of 
P(k + l)—that is, the truth of P(k + 1) is “built on” an initial consider¬ 
ation of the truth of P(k). This is in slight contrast to arguments (consi¬ 
dered in the next section) which begin with a consideration of P(k + 1). 


2,1.1. Use mathematical induction to prove the binomial theorem: 

(a + b)"= 2 f n .\a‘b"~‘, n a positive integer. 

;=o'' ' 


Solution. It is easy to check that the result holds when n = 1. 

Assuming the result for the integer k (we will build on the truth of P(k)), 
multiply each side by (a + b ) to get 

{a + b) k {a + b) = \ 2 I * la'**--' |(a + b) 




In the first sum, make the change of variable j = i + 1, to get 



where we have made use of the basic identity ( ; *,) + (f) = (*1 *) (see 2.5.2). 
This is the form for P(k + 1), so by induction, the proof is complete. 


2.1.2. Let 0 < a l < a 2 < • * ■ < a n , and let e, = ±1. Prove that 2"-i e A 
assumes at least ("J 1 ) distinct values as the e i range over the 2" possible 
combinations of signs. 


Solution. When n * 1, there are exactly 2 distinct values (a, and — a,), and 
(|) *= 1, so the result holds. 
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Suppose the result is true when n = k; that is, 2*- \ e , a , assumes at least 
(* 2 ') distinct values. Suppose another element a k+ , is given, with a k + l 
> a k . We must show that we can generate (*, 2 ) distinct sums. There are 
already (*\') distinct sums (generated by a ]t . . ., a k ); we need to generate 
(* 2 2 ) - (* j') = k + 1 additional sums. These can be found in the following 
manner: let S = ( so $ > 2*- i € , a , f° r all choices of ef, and note 

that S + «*+,,£ + (a k+] - a k ),S + («*+, - a k _,), . . . , S + (a k + , - a { ), 
are distinct and greater than S. There are k + 1 numbers in this list, so the 
result follows by induction. 

Mathematical induction is a method that can be tried on any problem of 
the form “Prove that P(n) holds for all n > a.” The clue is often signaled 
by the mere presence of the parameter n. But it should be noted that 
induction also applies to many problems where the quantification is over 
more general sets. For example, a proposition about all polynomials might 
be proved by inducting on the degree of the polynomial. A theorem about 
all matrices might be handled by inducting on the size of the matrix. 
Several results concerning propositions in symbolic logic are carried out by 
inducting on the number of logical connectives in the proposition. The list 
of unusual “inductive sets” could go on and on; we will be content to give 
just two examples here; other examples are scattered throughout the book 
(e.g. see the next four sections and the listings in the “Additional Exam¬ 
ples”). 


2.13. If V, E, and F are, respectively, the number of vertices, edges, and 
faces of a connected planar map, then 

V — E + F = 2. 


Solution. Your intuitive understanding of the terms in this result are 
probably accurate, but to make certain, here are the definitions. 

A network is a figure (in a plane or in space) consisting of finite, nonzero 
number of arcs, no two of which intersect except possibly at their end¬ 
points. The endpoints of these arcs are called vertices of the network. A path 
in a network is a sequence of different arcs in the network that can be 
traversed continuously without retracing any arc. A network is connected if 
every two different vertices of the network are vertices of some path in the 
network. A map is a network, together with a surface which contains the 
network. If this surface is a plane the map is called a planar map. The arcs 
of a planar map are called edges. The faces of a planar map are the regions 
that are defined by the boundaries (edges) of the map (the “ocean” is 
counted as a face). 

Figure 2.1 shows three examples of connected neworks. The first two are 
planar maps. In the first, V = 4, E = 4, F = 2; in the second, V = 5, E = 6, 
F= 3. The third network is not a planar map. However, if we should 
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flatten it onto a plane and place vertices at the intersection points, we 
would have V = 10, E = 20, F = 12. 

Now return to a consideration of the theorem. The key idea in the proof 
of this result is to realize that connected planar maps can be built from a 
single vertex by a sequence of the following constructions (each of which 
leaves the map connected): 

(i) Add a vertex in an existing edge (e.g. ■-• becomes ■—•—■). 

(ii) Add an edge from a vertex back to itself (e.g. • becomes O )* 

(iii) Add an edge between the two existing vertices (e.g. I ] becomes 

=!>■ 

(iv) Add an edge and a vertex to an existing vertex (e.g. • becomes •-•). 

We will induct on the number of steps required to construct the 
connected planar map. If the network consists of a single point, then y = 1, 
F = 1, E = 0, and V - E + F= 2. 

Suppose the result holds when k steps are required in the construction. 
The net change for each of the steps is given in the following table: 


Operation 

AV 

A E 

A F 

A( V — E + F) 

(i) 

+ 1 

+ 1 

0 

0 

(ii) 

0 

+ 1 

+ I 

0 

(iii) 

0 

+ 1 

+ 1 

0 

(iv) 

+ 1 

+ 1 

0 

0 


Since the quantity V ~ E + F remains unchanged when the (k + l)st step 
is taken, the proof is complete by induction. 


2 . 1 . 4 . Given a positive integer n and a real number x, prove that 




+ ■ 




= !«*]!• 


Solution. Although there is an integer parameter n in this problem, it will 
not work to induct on n for a fixed x. Also, of course, we cannot induct on 
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Jt, since x ranges over the real numbers (for a given real number x, there is 
no next larger real number^). Therefore, it is not clear that induction can 
be applied to this problem. 

The idea is to prove the result for all x in the subinterval [k/n,(k + 1) 
/«) for k = 0, ± 1 , ± 2, .. . . 

First, suppose x belongs to the subinterval [0,1/n). Then [ x + i/n J 
= 0 for i = 0,1, .. ., n — 1, so that 27=o [ x + i/n ] = 0. Also | nx J = 0, 
so the result is true in the “first” subinterval. 

Now suppose the result holds in the interval [(fc — 1 )jn,k/n), where k is 
a positive integer, and let x be any real number in this interval. Then 

[,l + [,+ !]+[, +2] + ... + [ J , + ' L =J.]. I „* 1 . 

By adding 1/n to x (thereby getting an arbitrary number in [k/n,(k + 1) 
/n), each of the terms, except the final term, on the left side of the previous 
equation is “shifted” one term to the right, and the final term, | x + (n — 
l)/n J, becomes |x+ 1 ], which exceeds [x ] by 1. Thus, replacing x 
by x + 1/n increases the left side of the previous equation by 1. 

At the same time, when x in [ nx ] is replaced by x + 1/n, the value is 
increased by 1. Since each side of the equation increases by 1 when x is 
replaced by x + 1/n, the result continues to hold for all numbers in the 
interval [k/n,(k + 1 )/n). 

By induction, the result is true for all positive values of x. A similar 
argument shows it is true for all negative values of x (replace x by 
x - 1/n). 

The next example is a good illustration of “building” P(k + 1) 
from P(k). 

2.1.5. If a > 0 and b > 0, then (n — \)a" + b” > na n ~ i b, n a positive 
integer, with equality if and only if a = b. 


Solution. The result is true for n = 1 ; assume the result true for the integer 
k. To build P(k + 1), we must, to get the proper left side. 


(i) multiply by a: 

(ii) add a k + i \ 

(iii) subtract b k a: 

(iv) add b k + i : 


(k - l)a* +l + b k a > ka k b, 

ka k+t + b k a> ka k b + a k + ', 
ka k + ] > ka k b + a k+ ' - b k a, 


ka k+i + b k "> ka k b + a k+1 - b k a + b k + { . 


We are assuming that this inequality is an equality if and only if a “ b. It 
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only remains to show that ka k b + a k +1 — ft^Y f> A + 1 > (k + 1 )a k b 
with equality if and only if a = b. To do this we work backwards: 

ka k b + a k +1 - b k a+ b k + i >(k+ 1 )a k b, 

-a k b + a k + ] - b k a+ b k + ' > 0, 
a k (a — b) + b k (b — a) > 0, 

{a k - b k )(a - b)> 0, 

and this is true (a - b and a k - b k have the same signs) with equality if 
and only if a = b. Thus, the proof follows by induction. (Note: This result 
is a special case of the arithmetic mean-geometric mean inequality; see 
Section 7.2.) 


Problems 

2.1.6. 

(a) Use induction to prove that 1 + 1 /\ll + 1/73 + • • • + 1 (\(n < 24n. 

(b) Use induction to prove that 2! 4! • • • (2n)! > ((« + 1)!)". 

2.1.7. The Euclidean plane is divided into regions by drawing a finite 
number of straight lines. Show that it is possible to color each of these 
regions either red or blue in such a way that no two adjacent regions have 
the same color. 

2.1.8. Prove that the equation x 2 + y 2 = z n has a solution in positive 
integers (x, y, z) for all n — 1,2,3, . .. . (For a nice proof, divide into two 
cases: even n and odd n. For a noninductive proof, see 3,5.1.) 

2.1.9. A group pP /j people play a round-robin tournament. Each game 
ends in either a win or a loss. Show that it is possible to label the players 
P 1 ,P 2 ,Pi, ■■■ , P„ in such a way that P t defeated P 2 , P 2 defeated 

P n -i defeated P n . 

2.1.10. If each person, in a group of « people, is a friend of at least half 
the people in the group, then it is possible to seat the n people in a circle so 
that everyone sits next to friends only. 

2.1.11. The following steps lead to another proof of the binomial theorem. 
We know that (a + x) n can be written as a polynomial of degree n, so there 
are constants A 0 ,A ,, .. ., A n such that 

(a + x)"=A 0 +A^ + A 2 x 2 + ■ ■ ■ +A„x”. 

(a) Use induction to describe the equation which results upon taking the 
fcth derivative of each side of this equation (A: = 1,2,..., n). 

(b) Evaluate A k for k — 0,1,..., n by setting x = 0 in the Adi equation 
found in part (a). 



64 


2. Two Important Principles: Induction and Pigeonhole 


2.1.12. Suppose that/: R -> R is a function for which f(2x — f{x)) = x for 
all x, and let r be a fixed real number. 

(a) Prove that if f(x) = x + r, then f(x - nr) = (x — nr) + r for all positive 
integers n. 

(b) Prove that if / is a one-to-one function (i.e., f(x) = f(y) implies x = y) 
then the property in (a) also holds for all integers n. 


Additional Examples: 

1.1.2, 1.1.8, 3.2.8, 6.5.13, 7.1.4. 


2.2. Induction: Set Up P(k + 1) 

In this section we consider induction arguments which begin with a direct 
assault on P(k + 1) and which work backwards to exploit the assumed 
truth of P(k). Theoretically, the arguments in this section could all be 
recast into the form of the previous section, and vice versa. However, from 
a practical standpoint, it is often much more convenient to think the one 
way rather than the other. 


2.2.1. Prove that n s /5 + n 4 /2 + n 2 /3 — «/30 is an integer for « = 0,1, 
2, . . . . 

Solution. The result is obviously true when n = 0. Assume the result holds 
for n = k. We need to prove that 

(*+l) s i (*+l ) 4 (A+l ) 3 (* + I) 

5“~ + 2 3 30 - 

is an integer. We expand, 

k 5 + 5 k 4 + 10 k 3 + 10k 2 + 5k + 1 . k 4 + 4Jt 3 + 6k 2 + 4k + 1 
5- + -2- 

, k 3 + 3 k 2 + 3& + 1 k + 1 
+ ^1-30~ 

and recombine (to make use of P{k)): 

k 5 . k 4 . k 2 k I 
T + T + T“ 30 

+ [(Jt 4 + 2 k 2 + 2 k 2 + k) + (2k 2 + 3 k 2 + 2k) + (k 2 + A)]. 

The first grouping is an integer by the inductive assumption, and the 
second grouping is an integer because it is a sum of integers. Thus, the 
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y 

proof follows by induction. (Notice how difficult it would have been to 
arrive at P(k + 1) by starting from P(k).) 


Let 

a,b. 

Pf 

Pi, ■ ■ 

■,Pn 

be real numbers with a^= b. Define f(x) 

~ x)(p 2 ~ 

~ X Xp3 ~ 

*)•' 

■ (p„ — *). Show that 


Pi 

a 

a 

a 

* * * a 

a 



b 

P2 

a 

a 

* * * a 

a 



b 

b 

Pi 

a 

* . . a 

a 


det 

b 

b 

b 

Pa 

. * . a 

a 

_ b f( a ) ~ a fi b ) 








b ~ a 


\b 

b 

b 

b 

‘ ‘ ‘ A-i 

a 



b 

b 

b 

b 

• * - b 

Pn 



Solution. This is similar to many determinant problems that can be worked 
by mathematical induction. When n = 1, we have /(*) = />, — x, and 
det(p,) = />,, and 

b K a ) - a f{ b ) _ HPi~ a )' a (Pi ~ b ) _ 

b — a b — a 

so the result holds. 

Assume the result holds for k - 1, k > I, and consider the case for k real 
numbers p lt ... ,p k . (We begin by setting up the situation for P(k) and 
plan to fall back on the truth of P(k - 1) to complete the inductive step.) 


We wish to evaluate 







/ 

/ 

Pi 

a 

a 

a 

• • a 

a 

/ 

b 

Pi 

a 

a 

a 

a 


b 

b 

Pi 

a ' ■ 

a 

a 

det 








b 

b 

b 

b 

Pk~ i 

a 


b 

b 

b 

b ■ 

• • b 

Pk t 


Subtract the second column from the first (this does not change the 
determinant): 



Pi-a 

a 

a 

a 

a 

a 


b ~ Pi 

Pi 

a 

a * * * 

a 

a 


0 

b 

Pi 

a ■ * • 

a 

a 

det 

0 

b 

b 

P4 

a 

a 


0 

b 

b 

b . • ■ 

Pk-i 

a 


0 

b 

b 

* . . . 

b 

a 
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and expand down the first column to get 



Pi 

a a a 



b p 2 • • • a a 


a)det 





b 

b ■■■ />*__, a 



b 

b - ■ b p k 



a a • • • a 

a 


b p 3 • * • a 

a 

(fr-^Jdet 




b b ■ ■■ Pk _ 

a 


b b ■■ ■ b 



The latter two determinants (on (k - l)-by-(fc - 1) matrices) are of the 
form for which we can apply the inductive assumption P(k — 1). To do 
this, we will need to introduce some notation. For the first determinant, set 
F(x) = (p 2 ~ *)(pi ~ ‘ • (pk ~ -*) and for the second, set G(x) = 

(a — x)(p 3 — x) ■ ■ ■ (p k — x). Then, by the inductive assumption, the last 
expression equals 


(Pi - a ) 


bF(a) - aF{b) 
b - a 


~( b ~P2) 


bG(a) — aG(b ) 


b — a 


But G(a) = 0 and (p t - a)F(a) = f(a), and therefore we have 

bf(a) - a( Pl - a)(p 2 - b) ■ • • (p k - b) - a(a - b)(p 2 - b) ■ • • (p k - b) 

b — a 

b f( a ) - a (P2 -b)---{p k ~ *)[(/», -a) + {a- 6 )] 

b-a 


bf(a) - af(b) 
b — a 

The result follows by induction. 


Problems 

2 . 23 . Give a proof for the inductive step in 1.1.3. 

2 . 2 . 4 . For all x in the interval 0 < x < it, prove that 

|sinnjc| < n sin jc, n a nonnegative integer. 

2 . 2 . 5 . Let Q denote the set of rational numbers. Find all functions f from 
Q to Q which satisfy the following two conditions: (i) /(1) = 2, and (ii) 
f(xy) = f(x)f(y) -f(x+y)+l for all x,y 'mQ. , 
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2.2.6. If a,b,c > 1, prove that 4 (abc + !)>(! + o)(l + 6X1 + <9- (Hint: 
Prove, more generally, that 2 n ~\a l a 1 ■ • • a„ + 1) > (1 4- a,)(l + a z ) ■ ■ ■ 
(1 + 

2.2.7. Given a set of 51 integers between 1 and 100 (inclusive), show that 
at least one member of the set must divide another member of the set. 
(Hint: Prove, more generally, that the same property will hold whenever 
n + 1 integers are chosen from the integers between 1 and 2 n (inclusive).) 
For a noninductive proof, see 2.6.1. 

2.2.8. Criticize the proof given below for the following theorem: 

An n-by-n matrix of nonnegative integers has the property that for any zero entry , 
the sum of the row plus the sum of the column containing that zero is at least n. 
Show that the sum of all elements of ike array is at least n 3 /2. 

Proof (?): The result holds for n= 1. Assume the result holds for 
n = k - 1, and consider a fc-by-fc matrix. If there are no zero entries, the 
result obviously holds. If a tJ = 0, the sum of row / and column j is at least k, 
by assumption, and the sum of the elements in the (k — 1 )-by-(A’ — 1) 
submatrix obtained by deleting row i and column j is at least (k — l) 2 /2 
(by the inductive assumption). It follows that the sum of the elements in the 
k-by-k matrix is at least (k - Xf jl + * = (k 2 - 2k + l)/2 + k = (k 2 + 1) 
/2 > k 2 / 2. The result follows by induction. 


Additional Examples 

1.1.11, 1.12.2/*lll, 4.2.21, 4.3.5“, 4.3.24, 6.5.12, 6.6.1, 7.1.6, 7.1.13, 7.2.5, 
7.3.5. 


2.3. Strong Induction 

Let o. be an integer and P(n) a proposition about n for each integer n > a. 
The strong form of mathematical induction states that: 

If 

(i) P(a) is true, and 

(ii) for each integer k > a, P(a), P{a + 1), . ,. , P(k) true implies 
P(k + 1) true, 

then P(n) is true for all integers n > a. 

This differs from the previous induction principle in that we are allowed 
a stronger assumption in step (ii), namely, we may assume P(a), P{a + 
1), . . ., P(k), instead of only P{k), to prove P(k + 1), Theoretically, the 
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two forms of induction are equivalent, but in practice there are problems 
which are more easily worked with this stronger form. 

2.3.1 (Pick’s theorem). Prove that the area of a simple lattice polygon (a 
polygon with lattice points as vertices whose sides do not cross) is given by 
I + jB — 1, where I and B denote respectively the number of interior and 
boundary lattice points of the polygon. 

Solution. We will induct on the number of sides in the polygon. The case of 
a triangle is given in 1.7.3. Consider, then, a simple lattice polygon P with k 
sides, k > 3. We first establish that such a polygon has an interior diagonal. 
This is clear if the polygon in convex (equivalently, if all the interior angles 
are less than 180°). So suppose the interior angle at some vertex, say V, is 
more than 180°. Then a ray emanating from V and sweeping the interior of 
the polygon must strike another vertex (otherwise the polygon encloses an 
infinite area), and this determines an interior diagonal D with V as one 
endpoint. 

Suppose that our polygon P has 1 interior points and B boundary points. 
The interior diagonal D divides P into two simple lattice polygons P t and 
P 2 with 1 1 and I 2 interior points respectively, and B ] and B 2 boundary 
points respectively. Suppose there are x lattice points on D, excluding its 
endpoints. Then B = B r + B 2 — 2 — 2x, and / = /, + I 2 + x. 

Now, let A, A and A 2 denote the areas of P, /*„ and P 2 respectively. 
Then 

A =Ai + A 2 

= (I l + lB ] -\) + (I 2 + iB 1 -l) 

= { I \ + h ) + 2(^1 + B i ) " 2 
= (/| + I 2 + x) + ^(Bj + B 2 — 2x) — 2 
= I + i(B + 2)-2 
= I+±B~ 1. 

The result follows by induction. 

Notice in this example that it is the first step of the induction argument 
which is the most difficult (done in 1.7.3); the inductive step (step (ii)) is 
conceptionally very simple. 


Problems 

2.3.2. 

(a) Prove that every positive integer greater than one may be written as a 
product of prime numbers. 

(b) Bertrand’s postulate, once a postulate but now a known theorem, states 
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that for every number x > 1, there is a prime number between x and 
2x. Use this fact to show that every positive integer can be written as a 
sum of distinct primes. (For this result, assume that one is a prime.) 

233 . 

(a) Show that every positive integer can be written as a sum of distinct 
Fibonacci numbers. 

(b) Let k » m mean that k > m + 2. Show that every positive integer n has 

a representation of the form n = F ki + F ki + • • • + F k , where F k . are 
Fibonacci numbers and k\ » k 2 k r 3> 0. 

(c) Show that the representation in part (b) is unique. 


Additional Examples 
3.1.1. 3.1.2, 3.1.18,3.5.5, 6.2.3. 

2.4. Induction and Generalization 


We have seen (in Section 1.12) that a problem is sometimes easier to handle 
when it is recast into a more general form. This is true also in induction 
problems. For example, it may happen that the original propositions 
/’(l), P(2), P(3), ... do not contain enough information to enable one to 
carry out the inductive step (step (ii)). In this case it is natural to reformu¬ 
late the propositions into a stronger, more general form 0(1), 0(2), . . . 
(where Q(n) implies P(n) for each n), and to look again for an inductive 
proof. 

2.4.1. If A | + • * ■ + A n = v, 0 < A t < 7T, i = 1, . .. , «, then 
sin >4. + * ■ • + sin^„ < n sin — . 

I n n 

Solution. Let P(k) be the statement of the theorem for a given k, and 
suppose P(k) is true. For the inductive step, suppose A , + * * ■ + A k + 
A k + 1 = 7T, 0 < A t < n, t = 1, . .. , k + 1. In this form, it is not clear how to 
make use of P(k). We might, for example, group A k and A k + ] together, so 
that Aj + • • • + A k _ l + ( A k + A k+] ) = it, and then apply the inductive 
assumption to get 

sin^, + - • • + sin^ t _, + sin(^ A + A k+ ,) < k sin ^ , 

But now it is not at all clear that this implies P(k + 1): 

sin^, + • • ■ + sin.4* + sin^ ft+l < (fc + l)sin + . 
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The requirement that the ^,’s add to it seems too restrictive. Consider 
instead the following proposition Q(n ): 

If 0 < A t < n, i = 1, . . ., n, then 


sin A | + * 


+ sin/J„ < 


.■J A i + •“ A *\ 
” Sin (- » -)■ 


(Note that Q(n) implies /’(«).) Obviously, 0(1) is true. Suppose that 
Q(k ) is true, and suppose that 0 < A I : < 57, / = 1, . . ., k + 1. Then 


sin/4[ + * • * + sin^ + sin/4* + , 


, / A. + ■ ■ * + A t \ . j 

< Asm^-£-j + nnA k+l 

~ <* + '>[ *TT sin ( - ' + ’^ A ‘ ) + ^ , 

*<* + ‘>[ !in (kh ( ) + J-La,,,)! 

f' -)- 

(The inequality in the next to last step follows from the result in 1.2.12(b).) 
The result now follows by induction. 

We are now able to prove the conjecture made in 1.6.2(e): The polygon 
of greatest area that can be inscribed in a circle is the regular polygon. To 
do this, suppose that P i ,P 2 ,. . ., P„, n > 3, are the successive vertices of an 
inscribed polygon (inscribed in a circle of radius r ), Let O denote the center 
of the circle; let 7] denote the area of triangle F ( f?F ; + i , i = 1, . . . , n (we set 
P „ + , * T,); let A t = L P,OP, + , (Figure 2.2). Then 

T < = 2[i(rcos^,)(rsint4)] 

= /■ 2 cos^/4 ( sinT / 4 ; . 

= i/^sin^,.. 



Figure 2.2. 
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The polygon of maximal area must satisfy 0 < A t < tt for each /, Thus, 
our preceding result shows that 

n 

Area of polygon = 2 ^ 

/«i 


= ^ i/- 2 sini4 ( = if 1 2 sin/4 ; 

i=i ;=i 


<S ~ r 2 sin 


il A , 

71 J-l j 


= n 


-Lr 
2 ' 


Hr ) 


The right-hand side is the area of a regular n-gon, and this completes the 
proof. 


2.4.2. Let f(x) — (x 2 — l) 1 ^ 2 , x > 1. Prove that /^’(x) > 0 for odd n and 
/ (n) (x) < 0 for even n. 


Solution. We might expect to be able to express f k + l) (x) in terms of 
f ki (x). But a look at the first few derivatives makes this plan appear 
hopeless: 


/'(*) “ 


(x 2 -l) 


1/2 


/'(*) = - 


(* 2 -l) 


3/2 


r\x) = 

/ (v> (*)= 


3x 


5/2 


(X 2 -l) 

60x 3 + 3lx 
(x 2 - if 1 


/ (iv V) = 

r\ X )= 


] 2 x 2 + 1 
(x 2 - 1) 7/2 ’ 

522x 4 + 266x 2 + 31 


(x 2 -l) 


11/2 


Consider instead the following reformulation: If f(x) = (x 2 - l) 1 ^ 2 , 
x > 1, then 


f"Xx) = 


&(*) 


(x 2 -l) 


(2»— l)/2 


where g„(x) is a polynomial of degree n — 2, and 


«.(*) is 


an odd function all of whose 
coefficients are nonnegative 
an even function all of whose 
coefficients are nonpositive 


if n is odd, 
if n is even. 


This proposition can be established by induction (we omit the messy 
details), and this implies the original result. 
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2.43. Let Fj denote the ith term in the Fibonacci sequence. Prove that 
F» 2 +l + F; = F 2 „ +l . 

Solution. The result holds for n = 1, so suppose the result holds for the 
integer k. Then 

Fi +2 + Fi + ,={F k + l + F k f+ 

= F U 1 + 2 F k+ ,F k + Ft + Ft + 1 

= ( Ft + , + /?) + ( 2 F k+ ,F k + F k \,) 

= F 2k + t + ( 2F k+\ F k + F k + l)> 

the last step by the inductive assumption. 

We would be done if we could show 2 F k + l F k + F k + I — F 2k + 2> for we 
could then continue the previous argument, F lk + , + (2F k + t F k + Ft +i ) 
= F 2 k +1 + F 2 k +2 = F 2 k + 3 , and this completes the inductive step. There¬ 
fore, it remains to prove that 2 F k + 1 F k + F k + 1 = F 2k + 2- We proceed by 
induction. It is true for n = 1, and assuming it true for k, we have 

2 F k+2 F k+ , + F t +2 - 2 (F k+l + F k )F t+ , + F* 2 +2 

= 2/- 2 +i + 2F* +i F* + F* 2 +2 

= {2F k+ t F k + F k + ,) + ( F t + 1 + Ft +1 ) 

= F 2 k + 2 + ( F k + \ + F k + 2 )- 

But now we are back to the earlier problem: does P k+2 + F t +1 = F i k + 3 ^ If 
so, *i*+i+ (** + ! + F k + 2 )= F 2 k + 2 + F 2 k + 3 = F 2 k+* and the induction is 
complete. Thus, the problems are interrelated: the truth of the first depends 
upon the truth of the second, and conversely, the truth of the second 
depends upon the truth of the first. 

We can resolve the difficulty by proving them both in the following 
manner. Consider the two propositions 

P(*y- F n + \ + F n = 

Q(n )= 2 F n + I F„ + Ft + , = F 2a +2 • 

P(l) and 0(1) are each true. The previous arguments show that P(k) and 
Q(k) imply P(k -f 1), and that P(k + 1) and Q(k ) imply Q(k + 1). It 
follows that P(k) and Q(k) imply P(k + 1) and Q(k + 1), and the proof is 
complete. 


2.4.4. Let f(x) = o^injc + fl 2 sin2je + ■ • ■ + u„sinnjc, where a,, . . ., a n 
are real numbers and where n is a positive integer. Given that |/(x)| 
< jsinje| for all real x, prove that |a, + 2a 2 + ■ • ■ + na„\ < 1. 
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Solution. Suppose we try inducting on the number of terms in /(*). When 
n = \, f(x) = a t sin*, and since J/(jr)J < |sin*j, it follows that |tz,| = 
|<j,sin(w/2)| = |/(w/2)| < jsin(sr/2)| = 1. 

Suppose the result holds for k , and consider the function 

/(*) = a, sin* + a 2 sin2* + • ■ • + u^sin/:* + a k+ ,sin(A: + 1)*, 

for some choice of real numbers a } ,a 2 , . .., a k+l , and suppose that /(•*)! 
< jsih*| for all real *. Since sin(/c + 1)* = sin A;* cos* + sin* cosfcx, we 
can write 

/(*) = (a, + o* + |Cos&*)sin* + o 2 sin2* +••• 

+ a k - |Sin(/t — 1)* + ( a k + n it + l cos*)sinfc*. 

We have now rewritten /(*) as a sum of k terms, more or less of the type 
from which we can apply the induction assumption. The difficulty is that 
the coefficients of the sine terms in this expression are not constants; rather 
they contain functions of *. This suggests considering the following more 
general problem. 

Let «](*), .... a„(x) be differentiable functions of *, and let /(-*) 
= a|(*)sin* + a 2 (*)sin2* + * • - + a n (*)sin nx. Given that j/(*)| < |sin*| 
for all real *, prove that 

K(°) + 2u 2 (0) + ■ • ■ +na„(0)| < 1. 

If we can prove this proposition, we will have solved the original 
problem also, because, taking a,(x) = a t , a, a constant, i = 1,2,. . ., ft, for 
all *, we recover the original problem. 

Again we proceed by induction. We are given |u](*)sin*| < |sin*j. As * 
approaches 0, sin* ^ 0, so that for these *, |«,(*)| < 1. Since a ,(*) is 
continuous at * = 0, it follows that |z7 t (0)| < 1. This implies that the result is 
true for the case n = 1. 

Now suppose the result is true for n = k, and consider the function 

/(*) = ari*) 5 * 11 * + « 2 (*)sin2* + • • * + jfjcJsin (At + 1)*, 

where |/(*)| < |sin*| and n,(*) are differentiable. As before, this can be 
rewritten in the equivalent form 

/(*) = [- a k+ ,(*)cosfc*]sin* -f « 2 (*)sin2* + ■ • ■ 

+ a k - ,(*)sin(fc - 1)* + [«*(*) + a k + ,(*)cos*]sin kx. 

We may now apply the inductive assumption, and conclude that 

l[«i(0) + «* + .(0)] + 2a 2 (0)+ ■■ • 

+ (*“ 1 K-i(°) + *[^(0) + ^+i( 0 )]l < 1- 

But this is the same as 

K(0) + 2u 2 {0) + - - - + ka k (0) + (k+ 1H + ,(0)| < 1. 

which is the desired form. (A noninductive proof of this result is given in 
6.3.2.) 
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Problems 

2.4.5. Let S denote an «-by-n lattice square, n > 3. Show that it is possible 
to draw a polygonal path consisting of 2n — 2 segments which will pass 
through all of the n 2 lattice points of S. 

2.4.6. Let / 0 (x) = 1/(1 - *), and define / rt+ ,(*) = xf'(x). Prove that 
/„+ ,(*) > 0 for 0 < x < 1. 


2.5. Recursion 


In the second solution to 1.1.1, we let A n denote the number of subsets of a 
set with n elements. We showed that A n+1 —2A n , A 0 = 1. This is an 
example of a recurrence relation. Even though we do not have an explicit 
formula for A n (as the method of induction requires), the recurrence 
relation defines a “loop” or algorithm which shows us how to compute 
A „ + !- In this section we look at problems that can be reduced to equivalent 
problems with smaller parameters. The idea is to apply the reduction 
argument recursively until the parameters reach values for which the prob¬ 
lem can be solved. 

2.5.1 (Tower-of-Hanoi problem). Suppose ft rings, with different outside 
diameters, are slipped onto an upright peg, the largest on the bottom, to 
form a pyramid (Figure 2.3). Two other upright pegs are placed sufficiently 
far apart. We wish to transfer all the rings, one at a time, to the second peg 
to form an identical pyramid. During the transfers, we are not permitted to 
place a larger ring on a smaller one (this necessitates using the third peg). 
What is the smallest number of moves necessary to complete the transfer? 


Solution. Let denote the minimal number of moves for a stack of n 

rings. Clearly = 1, so suppose n > 1. In order to get the largest ring on 

the bottom of the second peg, it is necessary to move the topmost « — 1 
rings to the third peg. This will take a minimum of Af„_, moves (by our 
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Figure 2.4. 

choice of notation). One move is necessary to transfer the largest ring to the 
second peg, and then M n _, moves are necessary to transfer the n — 1 rings 
to the second peg. Thus 

A/ n = 2A/ n _, + l, M, = 1. 

An easy induction, based on this recurrence, shows that M n = 2" +1 - 1 
(M„ +l = 2M„ + 1 = 2[2" +l - 1] + 1 = 2" +2 — 1). 

Let a lt a 2 , ■ ■ ■ , be a permutation of 1,2, . . ., n. We can interpret this 
permutation geometrically in the following way. Take an n by n chess¬ 
board, and for each i, place a rook in the i'th column (from the left) and the 
n ( th row (from the bottom). For example, the permutation 3,2,5,4,1 is 
represented in Figure 2.4. In this way we see that a permutation of 
1 ,2,. . ., n corresponds to a placement of n “nonattacking” rooks on the n 
by n chessboard. This correspondence enables one to think of permutations 
geometrically and to use the language and imagery of nonattacking rooks 
on a chessboard. 

2 . 5 . 2 . Let Q n denote the number of ways of placing n nonattacking rooks 
on the n-by-M chessboard so that the arrangement is symmetric about the 
diagonal from the lower left corner to the upper right comer. Show that 

&,= Qn-i +(« ” 1 )Q n -2- 

Solution. A rook in the first column may or may not occupy the square in 
the lower left comer of the board. If it does, there are £>„_, ways of placing 
the remaining n — 1 rooks. If it doesn’t, it can occupy any n — 1 squares in 
the first column. Once it is placed, it uniquely determines the location of a 
symmetrically placed rook (symmetric with respect to the given diagonal) in 
the first row. The remaining n — 2 rooks can be placed in Q„_ 2 ways. 
Putting these ideas together gives the result. 


2.5.3. A coin is tossed n times. What is the probability that two heads will 
turn up in succession somewhere in the sequence of throws? 
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Solution. Let P n denote the probability that two consecutive heads do not 
appear in n throws. Clearly P t = 1, p 2 = |. If n > 2, there are two cases. 

If the first throw is tails, then two consecutive heads will not appear in 
the remaining n — 1 tosses with probability P„_ , (by our choice of nota¬ 
tion). If the first throw is heads, the second toss must be tails to avoid two 
consecutive heads, and then two consecutive heads will not appear in the 
remaining n — 2 throws with probability P n ^ 2 - Thus, 

P* = 2 — I "b 4 Pn - 2 > n > 2. 

This recurrence can be transformed to a more familiar form by multiply¬ 
ing each side by 2": 

2 n P n = 2 + 2"~ 2 P „~ 2 , 
and setting S n = 2 "P n for each n: 

$n = I + $n-2 ■ 

This is the recurrence for the Fibonacci sequence (note that S n = F n+ 2 ). 
Thus, the probability we seek is Q„ = 1 — P n = 1 — F n+2 /2". 

The next example doesn’t lead to an explicit recursive formula, but it 
illustrates the “working backward” thinking that is characteristic of the 
recursive concept. 


2.5.4. Prove that any positive rational number can be expressed as a finite 
sum of distinct terms of the harmonic series. 


Solution. Let m/n be any positive rational. Then 

* .1 + 1 + ... +1 

n n n n 

is a sum of harmonic terms with n — 1 duplications. Recursively expand all 
duplicates by the identity 1/m = 1 f{n + 1) + 1 j n{n + 1) until all terms are 
distinct. 


Problems 

2.5.5. Let P n denote the number of regions formed when n lines are drawn 
in the Euclidean plane in such a way that no three are concurrent and no 
two are parallel. Show that P n+ , = P n + (m + I). 

2.5.6. 

(a) Let E„ denote the determinant of the M-by-n matrix having — l’s below 
the main diagonal (from upper left to lower right) and l’s on and above 
the main diagonal. Show that E, = 1 and E„ = 2E „_, for n > 1. 
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(b) Let D„ denote the determinant of the rt-by-n matrix whose (/, /)th 
element (the element in the /th row and yth column) is the absolute 
value of the difference of i and j. Show that D n = (— 1)"“'(« — 1)2 "~ 2 . 

(c) Let F n denote the determinant of the n-by-n matrix with a on the main 
diagonal, b on the superdiagonal (the diagonal immediately above the 
rhain diagonal—having n — 1 entries), and c on the subdiagonal (the 
diagonal immediately below the main diagonal—having n — 1 entries). 
Show that F n = aF n _, - bcF n _ 2 , n>2. What happens when a = b = 1 
and c = — 1? 

(d) Evaluate the n-by-n determinant A tt whose (/,/)th entry is by 
finding a recursive relationship between A„ and A„_,. 

2.5.7. 

(a) Let a x ,a 2 , ■ ■ ■ be positive real numbers and A n = (a t ■ + a n ) 
jn. Show that A n > A { n ”_~j bi« a V« w ith equality if and only if A „_, = a n . 
(Hint: Apply the inequality of 2.1.5.) 

(b) Aritkmetic-mean-geometric-mean inequality. Using part (a), show that 


with equality if and only if a t = a 2 = ■ • ■ = a n . 

2.5.8. Two ping pong players, A and B, agree to play several games. The 
players are evenly matched; suppose, however, that whoever serves first has 
probability P of winning that game (this may be player A in one game, or 
player B in another). Suppose A serves first in the first game, but thereafter 
the loser serves first. Let P„ denote the probability that A wins the nth 
game. Show that E n + I = P„(\ - P) + (1 - P„)P. 

2.5.9. A gambling student tosses a fair coin and scores one point for each 
head that turns up and two points for each tail. Prove that the probability 
of the student scoring exactly n points at some time in a sequence of n 
tosses is ^[2 + (-^)"]. (Hint: Let P„ denote the probability of scoring 
exactly n points at some time. Express P n in terms of P„_ t , or in terms of 
P„_ i and P„_ 2 . Use this recurrence relation to give an inductive proof.) 

2.5.10 (Josephus problem). Arrange the numbers 1,2 , ,n consecu¬ 
tively (say, clockwise) about the circumference of a circle. Now, remove 
number 2 and proceed clockwise by removing every other number, among 
those that remain, until only one number is left. (Thus, for n = 5, numbers 
are removed in the order 2, 4, 1, 5, and 3 remains alone.) Let /(«) denote 
the final number which remains. Show that 

f(2n) = 2 f(n) - 1, 

/(2n + 1) = 2/(«) + 1. 

(This problem is continued in 3.4.5.) 
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2.5.11. 

(a) Let R„ denote the number of ways of placing n nonattacking rooks on 
the n-by-n chessboard so that the arrangement is symmetric about a 
90° clockwise rotation of the board about the center. Show that 

H*. = (4»-2)1(4. 

^4n+ 1 = > 

^4n + 2 = 0 = ^4n+3 • 

(b) Let S n denote the number of ways of placing n nonattacking rooks on 
the n-by-n chessboard so that the arrangement is symmetric about the 
center of the board. Show that 

= 2nS 2 „-2 > 

$2n + I ~ S 2n ■ 

(c) Let T„ denote the number of ways of placing n nonattacking rooks on 
the n-by-n chessboard so that the arrangement is symmetric about both 
diagonals. Show that 

S 2 = 2, 

2n + I > 

^2n = ^^2n-2 + (2» — 2)5 2 „_ 4 . 

2.5.12. A regular 2n-gon is inscribed in a circle. Let T n denote the number 
of ways it is possible to join its vertices in pairs so that the resulting 
segments do not intersect one another. If we set T a = 1, show that 

T n = T 0 T„_ i + TJ„_ 2 + T 2 T„_3+ ■■ • + T„. t T a . 

(For a continuation of this problem, see 5.4.10.) 

2.5.13. Let at,, a 2 , ■ ■ ■ ,a„ be a permutation of the set S„ = { 1,2, . . ., «}. 
An element i in S„ is called a fixed point of this permutation if a i — i. 

(a) A derangement of S n is a permutation of £„ having no fixed points. Let 
g„ be the number of derangements of S„. Show that 

S\ = Si= l 

and 

Sn = {*- 1K&-I +^- 2 ). ^ n>2. 

(Hint: a derangement either interchanges the first element with another 
or it doesn’t.) 

(b) Let f„ be the number of permutations of £„ with exactly one fixed 
point. Show that |/„ — g„\ = 1. 

2.5.14. Suppose n men check in their hats as they arrive for dinner. As 
they leave, the hats are given back in a random order. What is the 
probability that no man gets back his own hat? (Hint: Let p„ denote this 
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probability. Then p n = g„/n\, where g n is as in 2.5.13. Let C n = p n — 

Use the recurrence relation found in 2.5.13(a) to show that C> = C„ = 
— C„_ i /n. Use this to show that p„= 1/2! — 1/3! + - - ■ +(—1)"/«!. 
Then for large n, p n zz l/e.) 

2.5.15. 

(a) Let 1„ = /J| /2 sin"jt dx. Find a recurrence relation for I n . 

(b) Show that 

1 x 3 x 5 x ■ - ■ x (2« - 1) „ 

2n 2 X 4 X 6 X * * * X 2n ' 2 ' 

(c) Show that 

2 X 4 X 6 X • • • x (2n - 2) 

/2fl + l = 1 X 3 X5 X * • • X (2n — 1) ' 


Additional Examples 

1.1.1 (Solution 2), 4.3.9, 5.3.5, 5.3.14, 5.3.15, 5.4.8, 5.4.9, 5.4.24, 5.4.25, 
5.4.26. Closely related to induction and recursion are arguments based on 
“repeated arguments”. Examples of what is meant here are 4.4.4, 4.4.17, the 
proof of the intermediate-value theorem in 6.1, 6.1.5, 6.1.6, 6.3.6, 6.8.10, 
and the heuristic for the arithmetic-mean-geometric-mean inequality given 
in Section 7.2. 


2.6. Pigeonhole Principle 


When a sufficiently large collecdon of objects is divided into a sufficiently 
small number of classes, one of the classes will contain a certain minimum 
number of objects. This is made more precise in the following self-evident 
proposition: 

Pigeonhole Principle. If krt + 1 objects (k > 1) are distributed among n boxes, one 
of the boxes will contain at least k + 1 objects. 

This principle, even when k = 1, is a very powerful tool for proving 
existence theorems. It takes some experience, however, to recognize when 
and how to use it. 

2.6.1. Given a set of n + 1 positive integers, none of which exceeds 2 n, 
show that at least one member of the set must divide another member of 
the set. 
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Solution. This is the same as 2.2.7, where it was done by induction on n,i 
However, the problem is really an existence problem for a given n, and it 
can be carried out very nicely by the pigeonhole principle, as we shall see. 

Let the chosen numbers be denoted by x t ,x 2 , . .., x n+ ,, and for each r, 
write x i = 2”y j , where n t is nonnegative integer and y, is odd. Let T = 
{y -: r = 1,2, . .. , « + 1}. Then T is a collection of « + 1 odd integers, each 
less than 2 n. Since there are only n odd numbers less than 2«, the 
pigeonhole principle implies that two numbers in T are equal, say y, = y, 
i < j. Then 

Xi =2'\v, and x j = 2 n *y i . 

If n : < then x ; divides xx if n i > n , then x- divides x r This completes 
the proof. 


2.6.2, Consider any five points P h P 2 , P 2 , P 4 , P\ in the interior of a square 
S of side length 1. Denote by d i} the distance between the points P t and Pj. 
Prove that at least one of the distances d tJ is less than 72 /2. 


Solution. Divide S into four congruent squares as shown in Figure 2.5. By 
the pigeonhole principle, two points belong to one of these squares (a point 
on the boundary of two smaller squares can be claimed by both squares). 
The distance between these points is less than ^2/2. 


2,63, Suppose that each square of a 4-by-7 chessboard, as shown below, is 
colored either black or white. Prove that in any such coloring, the board 
must contain a rectangle (formed by the horizontal and vertical lines of the 
board), such as the one outlined in the Figure 2.6, whose distinct comer 
squares are all the same color. 

Solution. Such a rectangle exists even on a 3-by-7 board. The color 
configurations of the columns each must be of one of the types shown in 
Figure 2.7. 


Figure 2.5. 


/ 
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Figure 2.6. 



Figure 2.7. 

Suppose one of the columns is of type 1. We are done if any of the 
remaining six columns are of type 1, 2, 3, or 4. So suppose each of the other 
columns is of type 5, 6, 7, or 8. Then, by the pigeonhole principle, two of 
these six columns must have the same type and we are done. 

The same argument applies if one of the columns is of type 8. 

So suppose none of the columns are of type 1 or type 8. Then we have 
seven columns but only six types. By the pigeonhole principle, two columns 
have the same type and the proof is complete. 


2.6.4. Prove that there exist integers a,b,c not all zero and each of 
absolute value less than one million, such that 

\a + bjl + cfi\ < 10" “. 


Solution. Let S be the set of 10 18 real numbers r + s-j2 + fV3 with each of 
r,s,t in {0,1,2, . .. , 10 6 - 1}, and let d = (1 +^2 + VTJIO 6 . Then each x in 
S is in the interval 0 < x < d. Partition this interval into 10 18 — 1 equal 
subintervals, each of length e = d/{\ 0 18 — 1). By the pigeonhole principle, 
two of the 10 18 numbers of S must be in the same subinterval. Their 
difference, a 4 - b-fl +c^3, gives the desired a,b,c, since e < 10 7 / 10 ls 
- 10 “ u . 


2.6.5. Given any set of ten natural numbers between 1 and 99 inclusive 
(decimal notation), prove that there are two disjoint nonempty subsets of 
the set with equal sums of their elements. 
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,, 

v‘' 10 0 

Solution. With the chosen set of ten numbers, we can form 2 — 1 ~ 1023 
(different) nonempty subsets. Each of these subsets has a sum smaller than 
1000, since even 90 + 91 + ■ • * + 99 < 1000, Therefore, by the pigeonhole 
principle, two subsets A and B must have the same sum. By throwing away 
the elements which belong to both sets we obtain two disjoint sets X = A — 
Ar\B,Y=B-ACiB, with the same sum. (Neither X nor Y is empty, for 
this would mean that either A c B or B c A, which is impossible, since 
their elements add to the same number.) 


Problems 

2.6.6. Let A be any set of 20 distinct integers chosen from the arithmetic 
progression 1,4,7, . . . , 100. Prove that there must be two distinct integers 
in A whose sum is 104. 

2.6.7. 

(a) Let S be a square region (in the plane) of side length 2 inches. Show 
that among any nine points in S, there are three which are the vertices 
of a triangle of area < ± square inch. 

(b) Nineteen darts are thrown onto a dartboard which has the shape of a 
regular hexagon with side length one foot. Show that two darts are 
within Vi/3 feet of each other. 

2.6.8. Show that if there are n people at a party, then two of them know 
the same number of people (among those present). 

2.6.9. Fifteen chairs are evenly placed around a circular table on which 
are name cards for fifteen guests. The guests fail to notice these cards until 
after they have sat down, and it turns out that no one is sitting in front of 
his own card. Prove that the table can be rotated so that at least two of the 
guests are simultaneously correctly seated. 

2.6.10. Let X be any real number. Prove that among the numbers 

X,2X, ...,(n-l)X 

there is one that differs from an integer by at most 1/n. 

2 . 6 . 11 . 


(a) Prove that in any group of six people there are either three mutual 
friends or three mutual strangers. (Hint: Represent the people by the 
vertices of a regular hexagon. Connect two vertices with a red line 
segment if the couple represented by these vertices are friends; other¬ 
wise connect them with a blue line segment. Consider one of the 
vertices, say A . At least three line segments emanating from A have the 
same color. There are two cases to consider.) 
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(b) Seventeen people correspond by mail with one another—each one with 
all the rest. In their letters only three topics are discussed. Each pair of 
correspondents deals with only one of the topics. Prove that there are at 
least three people who write to each other about the same topic. 

2.6.12. Prove that no seven positive integers, not exceeding 24, can have 
sums of all subsets different. 


Additional Examples 

1.10.1, 3.2.1, 3.2.5, 3.2.19, 3.2.20, 3.3.24, 4.4.10. 



Chapter 3. Arithmetic 


In this chapter we consider problem-solving methods that are important in 
solving arithmetic problems. Perhaps the most basic technique is based on 
the fundamental theorem of arithmetic, which states that every integer can 
be written uniquely as a product of primes. The theoretical background 
necessary for the proof of this key theorem requires a discussion of the 
notion of divisibility. Therefore, we will begin the chapter by considering 
problems about greatest common divisors and least common multiples. 
Important to this understanding are the division algorithm and the Euclid¬ 
ean algorithm. 

In the second section we introduce the technique of modular arithmetic 
(a generalization of the notion of parity), and see in it an efficient and 
effective method for many problems concerned with relationships between 
integers. In the last two sections we are again reminded of the importance 
of notation in solving problems, and we consider problems related to the 
representation of numbers: the positional notation for integers, and the 
rectangular, polar, and exponential notations for representing complex 
numbers. 


3,1. Greatest Common Divisor 


Given integers a and b, we say that a divides b, and we write a \ b, if there is 
an integer q such that b — qa. On the basis of this definition it is easy to 
prove the following very useful result: If n divides two of the terms in the 
expression a = b + c, then n divides all three of the terms. (Note: In this 
chapter, unless otherwise stated, all variables are integer variables.) 
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If a l3 ... ,a„ are given integers, we will denote their greatest com¬ 
mon divisor by gcdfo,, .. ., a n ), and their least common multiple by 
lcm(a,, . . ., a„). 


3.1.1. Find all functions/which satisfy the three conditions 

(i) f(x,x) = a, 

(ii) f(x,y)=f(y,x), 

(iii) f(x, y)=f(x,x + y), 

assuming that the variables and the values of / are positive integers. 


Solution. A look at special cases leads us to suspect that /(a, y) = gcd(v, y). 
We will prove this by inducting on the sum x + y. 

The smallest value for x +y is 2, and this occurs when x = >’ = 1- By (i), 
/(1,1) = 1, and also gcd(l, I) = 1, so our supposition is confirmed in this 
case. 

Suppose that x and y are positive integers such that x + y = k > 2, and 
suppose the claim has been shown for ail smaller sums. By (i) and (ii), there 
is no loss in generality in supposing that x<y. By (iii), f(x,y) = 
f(x, x + (y — a')) = f(x, y - a). But by the inductive assumption, f(x, y - 
a) = gcd(jr, y — a). The proof will be complete if we can show that 
gcd(A, y - a) « gcd(*,_y). 

If c|a and c\y, then c | a and c \y - a. It follows that gcd(A. y) 
< gcd(A, y — a). Similarly, if c | a and c\y — x, then c|a and c |_v, and 
therefore gcd(A, y — x) < gcd(A, y). Putting these together, it must be the 
case that gcd(A, >> — a) = gcd(.v, _>>), and the proof is complete. 

The following result rests at the very foundation of number theory. 

Division Algorithm. // a and b are arbitrary integers, b > 0, there are unique 

integers q and r such that 

a = qb + r, 0 < r < b. 

By repeated use of the division algorithm we can compute the greatest 
common divisor of two integers. To see how this goes, suppose that 6, and 
b 2 are positive integers, with b x > b 2 . By the division algorithm there are 
integers q and b 3 such that 

bi = qb 2 + h 3 , 0 < b 3 < b 2 . 

It is easy to check, using this equation, that gcd (b l ,b 2 ) = gcd (b 2 ,b 3 ). 

If b 3 = 0 f then gcdffcj, b 2 ) = b 2 ■ If b 3 > 0, we can repeat the procedure, 
using b 2 and b 3 instead of b t and b 2 , to produce an integer b A such that 
gcd(h 2 , b 3 ) = gcd(b 3 ,b 4 ), b 3 > b 4 > 0. 
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By continuing in this way, we will generate a decreasing sequence of 
non negative integers 

b t > b 2 > b 3 > 

such that gcdf*,,^) = gcd(6 2 ,£ 3 ) = • • * = gcd(b h b i+} ), i = 1,2,3, . .. . 
Since such a sequence cannot decrease indefinitely, there will be a first n 
such that b n+1 = 0. At this point gcd(6,,6 2 ) = gcd (b n ,b n+ ,) = . 

This procedure for finding gcd (b 1 ,b 2 ) is called the Euclidean algorithm. 
Before giving an example of this algorithm, we will state and prove the 
major result of this section. 


3.1.2. Given positive integers a and b, there are integers s and / such that 

sa + tb = gcd (a,b). 


Solution. We will prove the result by inducting on the number of steps 
required by the Euclidean algorithm to produce the greatest common 
divisor of a and b. (Another proof is outlined in 3.1.9.) 

Suppose a> b. If only one step is required, there is an integer q such 
that a = bq, and in this case gcd(a, b) = b. Also, in this case, gcd (a,b) = b 
= a + (1 — q)b, so set s = 1, t = 1 — q, and the proof is complete. 

Assume the result has been proved for all pairs of positive integers which 
require less than k steps, and assume that a and b are integers that require 
k steps, k > 1. By the division algorithm, there are integers q and r such 
that 

a = qb + r, 0 < r < b. 

The greatest common divisor of b and r can be computed by the Euclidean 
algorithm in A: — 1 steps, so by the inductive assumption, there are integers 
c and d such that 

cb + dr = gcd(6,r). 

From these last two equations, it follows that 

gcd(a,£) = gcd(h,r) 

= cb + dr 
= cb + d(a — qb) 

= da + (c — dq)b, 

and the proof is complete upon setting s = d and t — c — dq. 

The steps of this proof will be clarified by an example. 

3.13. Find integers x andy such that 

754x + 221 y = gcd(754, 221). 
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Solution. We first apply the steps of the Euclidean algorithm to find the 
greatest common divisor of 754 and 221. We find that 

754 = 3 x221 +91, 

221 = 2 X 91 + 39, 

91 = 2X39+ 13, 

39 = 3 x 13. 

This shows that gcd(754,221) = 13. 

To find the desired integers x and y, we proceed “backwards” through 
the steps of the Euclidean algorithm (this was the essence of the inductive 
proof given above): 

13 =91 - 2 X 39 

= 91 — 2(221 — 2 X 91) 

= 5X91 -2x221 
= 5(754 — 3 X 221) — 2 X 221 
= 5 X 754 - 17 x 221. 

Thus, one solution is x = 5 and y = - 17. 

The following result is often useful. 


3.1.4. The equation ax + by — c, a,b,c integers, has a solution in inte¬ 
gers x and y if and only if gcd(u, b) divides c. Moreover, if (v 0 , v p ) is an 
integer solution, then for each integer k , the values 


at' = at 0 + bk/d, 
y'=y 0 - ak/d. 


d = gcd(o, b). 


are also a solution, and all integer solutions are of this form. 


Solution. For the first part, it is clear that gcd(a,6) must divide c, since 
gcd(o,h) divides ax + by. Therefore, gcd(tf,6)|c is a necessary condition 
for the existence of a solution. On the other hand, if c is a multiple of 
gcd(d, b), say c = gcd(a, b) X q, we can find an integer solution in the 
following manner. We know there are integers s and t such that sa + tb 
= gcd(n, b). So set x = sq and y = tq. Then ax + by = asq + btq = (as + 
tb)q = gcd (a,b)q = c. 

A straightforward calculation shows that (x',y'\ as given, gives a 
solution, provided (x 0 ,y 0 ) is a solution. To show all integer solutions have 
this form we argue geometrically as follows (Figure 3.1). 

Note that the problem of solving ax + by = c in integers x and y is 
equivalent to the problem of finding the lattice points that lie on the 
straight line ax + by = c. Suppose that (,v 0 , y 0 ) is a lattice point on the line 
ax + by = c; that is, suppose that 


= c. 


ax o + by 0 
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The result is easy to prove if b = 0, so suppose here that 6 # 0. If (x\ y‘) is 
any other lattice point in the plane, then ( x ', y') will be on the line 
ax + by — c if and only if 

— = = where d=*%cd{a,by 

x - * 0 b b/d v ! 

Since a/d and b/d are relatively prime, this equation will hold if and only 
if there is an integer k such that 

y‘-y 0 = -(a/d)k, 

x' — = {bjd)k. 

It follows that all integer solutions of ax + by = c are given by the 
equations 

x' = x 0 + bk/d, 
y’^y^-akfd, 

k an integer, d = gcd(u,i>). 

3.1.5. Prove that the fraction (2In + 4)/(14n + 3) is irreducible for every 
natural number n. 

Solution. We need to prove that 14n + 3 and 21n + 4 are relatively prime 
for all n. Our preceding discussion shows that we will be done if we can 
prove that there exist integers s and t such that 

s(2ln + 4) + f(14n + 3) = 1, 


or equivalently. 


7n(3s + 21) + (4s + 3r) = 1. 
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This equation will hold for all n if we can find integers s and t which satisfy 

35 + 2t = 0 , 

4$ + 3r = 1. 

It is straightforward to see that these equations are satisfied by s = —2 and 
t ~ 3, and this completes the proof. 


3.1.6. The measure of a given angle is 180°/n, where n is a positive integer 
not divisible by 3. Prove that the angle can be trisected by Euclidean means 
(straightedge and compass). 


Solution. We do not expect this problem to have anything to do with 
numbers, and yet, what is the significance of the condition that n is not 
divisible by 3? This means 3 and n are relatively prime, so there are integers 
s and t such that 

hj + 3t =■ 1. 

We wish to construct an angle of 60°/ n. When we multiply each side of the 
last equation by 60 a / n, we get 

60°5 + (180°/n)t =■ 60°/n. 

But now observe that the left side of this equation describes how to 
construct 60°/n. This is because we can construct a 60°angle, we are given 
the angle 180°/n, the integers s and t can be found, and therefore we can 
construct 60° 5 + (180 ° jri)t. 

Problems 

3.1.7. If gcd(u,&) = 1, prove that 

(a) gcd(o - b, a + b) < 2, 

(b) gcdftf — b,a + b,ab) = 1, 

(c) gcd(a 2 — ab + b 2 , a + b) < 3. 

3 . 1 . 8 , The algebraic sum of any number of irreducible fractions whose 
denominators are relatively prime to each other cannot be an integer. That 
is, if gcd (a jt b,) — 1, / = I, . . ., n, and gcd (b it bj) = 1 for i j, show that 



is not an integer. 

3.1.9. Let S be a nonempty set of integers such that 

(i) the difference x — y is in S whenever x and y are in S , and 

(ii) all multiples of x are in S whenever x is in S. 
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(a) Prove that there is an integer d in S such that S consists of all multiples 
of d. (Hint: Consider the smallest positive integer in S.) 

(b) Show that part (a) applies to the set { ma + nb \ m and n are positive 
integers}, and show that the resulting d is gcd(a, b). 

3.1.10. 

(a) Prove that any two successive Fibonacci numbers F„,F n + 1 , n > 2, are 
relatively prime. 

(b) Given that T, = 2, and T n + ] = T„ — T n 4- 1, « > 0, prove that T n and 
T m are relatively prime whenever n=£ m. 

3.1.11. For positive integers prove there exist integers 

k lt . . . , k„ such that + * • • + k„a n = gcd(a,, . . . , a n ). 

3.1.12. Prove that (a + b)/(c + d) is irreducible if ad - be = 1. 

3.1.13. Prove that gcd(o lt . . . , a m )gcd(i> l , = gcd(a l b l ,a 2 b 1 , . .. , 

a m b n ), where the parentheses on the right include all mn products a : b^ 
i = 1, . . ., m,j = 1,. . ., n, 

3.1.14. When Mr. Smith cashed a check for x dollars and y cents, he 
received instead y dollars and x cents, and found that he had two cents 
more than twice the proper amount. For how much was the check written? 

3.1.15. Find the smallest positive integer a for which 

1001* + 770y = 1,000,000 + a 

is possible, and show that it has then 100 solutions in positive integers. 

3.1.16. A man goes to a stream with a 9-pint container and a 16-pint 
container. What should he do to get 1 pint of water in the 16-pint 
container? (Hint: Find integers s and t such that 9s + 16r = 1.) 

3.1.17. There is more than one integer greater than 1 which, when divided 
by any integer k such that 2 < k < 11, has a remainder of 1. What is the 
difference between the two smallest such integers? 

3.1.18. Let b be an integer greater than one. Prove that for every nonnega¬ 
tive integer N, there is a unique nonnegative integer n and unique integers 
a it i = 0, l,. . . , n, 0 < a, < b, such that a n ¥= 0 and 

N = a„b n + a n _ l b’’~ l + ■ • • + a 2 b l + a t b + a 0 . 

(The result is immediate for N < b, so assume N > b. Use induction.) 


Additional Examples 

3.2.4, 3.2.21, 3.3.11, 3.3.19, 3.3.28, 4.1.9, 4.2.1, 4.2.2, 4.2.4, corollary (iii) of 
Lagrange’s theorem in Section 4.4, 4.4.5, 4.4.6, 4.4.8. 
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3.2. Modular Arithmetic 


The parity of an integer tells us how that number stands relative to the 
number 2, Specifically, a number is even or odd according to whether its 
remainder when divided by 2 is zero or one respectively. This formulation 
of parity makes it natural to generalize the idea in the following manner. 
Given an integer n > 2, divide the set of integers into “congruence” 
classes according to their remainders when they are divided by n; that is to 
say, two integers are put into the same congruence class if they have the 
same remainders when they are divided by n. For example, for n = 4, the 
integers are divided into four sets identified with the possible remainders 
0,1,2,3. For an arbitrary n > 2, there will be n congruence classes, labeled 
0 , 1 , 2 , 1 . 

Two integers .* and y are said to be congruent modulo n, written 

x =y (mod n), 

if they each give the same remainder when they are divided by n (or, 
equivalently, and more conveniently in practice, if * — y is divisible by n). 
It is easy to prove that 

(i) x = x (mod n), 

(ii) x = y (mod n) implies y = .v (mod n), and 

(iii) [* s y (mod «) and y s z (mod «)] imply x = z (mod ri). 

These properties mean that congruence has the same characteristics as 
equality, and we often think of congruence as a kind of equality (in fact we 
sometimes read x = y (mod n) as “x equals y modulo n”). 


3,2.1. Prove that any subset of 55 numbers chosen from the set {1,2,3, 
4, . . ., 100} must contain two numbers differing by 9. 


Solution. There are nine congruence classes modulo 9: 0,1,2,3,4,5,6,7, 8 . 
By the (generalized) pigeonhole principle, seven numbers from the chosen 
55 are in the same congruence class (if each congruence class had six or 
less, this would account for at most 54 of the 55 elements). Let a v . . . , a 7 
be these numbers, and suppose they are labeled so that a , < a 2 < a 3 < * * * 
<a 7 . Since a i+l = a,- (mod 9), a i+ , — a. E {9,18, .. . }. We claim that 
a i+ ,-< 3=9 for some i. For if not, then for each /, a, + , - u,> 18, and this 
would mean that a 7 - a, ? 6 X 18 = 108. But this is impossible, since 
a 7 — a, < 100. Thus, two of the elements (among a,,. . ., a 7 ) differ by 9. 

The real power of congruences is a consequence of the following easily 
proved property. 
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Modular Arithmetic. If x = y (mod n) and u = v (mod n ) then 


and 


x + u = y + c (mod n), 
x ■ u =.y ■ v (mod n). 


This result allows us to perform arithmetic by working solely with the 
“remainders” modulo n. For example, since 

17 = 5 (mod 12) and 40 = 4 (mod 12), 

we know that 

17+ 40 = 5 + 4= 9 (mod 12) 

and 


17 X 40 = 5x4 = 8 (mod 12). 

Let m be a positive integer, n > 1, and let Z„ = {0,1,2, . . ., n — 1}. 
Observe that if x and y are elements of Z„, there are unique elements r, s, t 
in Z„ such that 

x — y = r (mod m), 
x + y = s (mod n), 
x ■ y = t (mod «). 

The set Z„ together with these operators of subtraction, addition, and 
multiplication is called the set of integers modulo n. In this system, 
computations are carried out as usual, except the result is always reduced 
(modulo n) to an equivalent number in the set Z„. 

3,2,2, Let N — 22 X 31 + 11 X 17 + 13 X 19. Determine (a) the parity of 
N; (b) the units digit of (V; (c) the remainder when N is divided by 7. (Of 
course, the idea is to make these determinations without actually comput¬ 
ing IV.) 

Solution. For part (a), 22 X 31 is even, since 22 is even, 11 X 17 is odd, and 
13 X 19 is odd, so the sum is even + odd + odd, and this is even. Notice 
that this reasoning is equivalent to computing modulo 2: 

22X31 + 11X17 + 13X19 = 0X1 + 1X1 + 1X1 = 1 + 1 = 0(mod2). 

For part (b), we need only keep track of the units digit: 22 X 31 has a 
units digit of 2, 11 X 17 has a units digit of 7, and 13 X 19 has a units digit 
of 7. Therefore, the units digit of N is the units digit of 2 + 7 + 7, or 6. Here 
again, this analysis is equivalent to computing N modulo 10: 

22x31 + 11 X 17+ 13 X 19 = 2 x 1 + 1 X 7 + 3 X 9 (mod 10) 

= 2 + 7 + 7 = 6 (mod 10). 
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Whereas parts (a) and (b) can be done without an awareness of modular 
arithmetic, it is not so apparent what should be done in part (c). The point 
of the example is that part (c) can be handled as a natural extension of the 
modular approach used in the previous cases. We work modulo 7: 

22 X 31 + 11 X 17+ 13 X 19s 1 X 3 + 4 X 3 + (-1) X 5 (mod 7) 

=3+5-5=3 (mod 7). 

Thus N is 3 more than a multiple of 7. (As a check: N = 1116 = 459 X 7 
+ 3.) 


3.23. What are the last two digits of 3 1234 ? 


Solution. We work modulo 100. There are many way to build up to 3 1234 . 
For example, 3 2 = 9 (mod 100), 3 4 == 8 1 (mod 100), 3 s = 81 X 81 = 61 (mod 
100), 3 10 = 9 x 61 =49 (mod 100), 3 20 = 49x49 = 1 (mod 100). Since 
1234 = 20 x 61 + 14, we have 3 1234 = (3 2<> ) 6l (3) 14 = 3 U = 3 4 3 !0 = 81 X 49 
= 69 (mod 100). The last two digits are thus seen to be 69. 


3,2.4. Show that some positive multiple of 21 has 241 as its final three 
digits. 

Solution. We must prove that there is a positive integer n such that 

21/i = 241 (mod 1000). 

Since 21 and 1000 are relatively prime, there are integers s and t such that 

21s + lOOOr = 1. 

Multiply each side of this equation by 241, and rearrange in the form 

21(241 j) -241 -241 X lOOOr. 

In congruence notation, the last equation means that 
21 X 241 j = 241 (mod 1000). 

If s is positive, we are done, for we can set n = 241s. If s is not positive, let 
n = 241 j + 1000/:, where k is an integer large enough to make n positive 
(by choosing k in the appropriate manner, we may even assume that n is 
between 0 and 1000). It follows that 

21« =21(241 j + 1000/:) = 21 X 241 s = 241 (mod 1000). 


3,2,5. Prove that for any set of n integers, there is a subset of them whose 
sum is divisible by «. 
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Solution. Let x t ,x z , . . . , jc„ denote the given integers, and let 

= *1. 

y 1 = x l + x 2 , 

7 n = + *2 + ‘ • ‘ + 

If \’j = 0 (mod n) for some /, we’re done, so suppose this is not the case. 
Then we have n numbers^,, . .. ,y n , and n — 1 congruence classes modulo 
n (namely, 1,2,.... n — 1), so by the pigeonhole principle, two of then’s 
must be congruent to one another modulo n. Suppose v, =y } (mod n), with 
i < j. Then 

*,■+!+•••+ x j=yj ~y, =0(modn), 
and the proof is complete. 

In the preceding example, we made use of the fact that n divides a if and 
only if a = 0 (mod n). By means of this correspondence, problems concern¬ 
ing divisibility can be translated directly into the language of modular 
arithmetic. 


3.2.6. Prove that if 2n + 1 and 3 n + 1 are both perfect squares, then n is 
divisible by 40. 


Solution. It is enough to show that n is divisible by both 5 and 8. This is 
equivalent to showing that nsO (mod 5) and n = 0 (mod 8). 

Consider modulo 5. The table below shows that a square number is 
either 0, 1, or - 1 modulo 5: 


x (mod 5) 

0 

I 

2 

3 

4 

x 2 (mod 5) 

0 

1 

- 1 

- 1 

I 


Thus, 2 n + 1 and 3 n + 1 must be either 0, I, or — 1 modulo 5. There are 
nine cases to consider: 2 n + 1 can be 0, 1, or - 1 modulo 5, and 3 n + 1 can 
be 0, 1, or — I. Some thought however reduces the number of cases to just 
two, as we shall see. Suppose that 2n + 1 = a (mod 5) and 3n + 1 = b 
(mod 5), a,b E {0,1,-1}. 

Case 1. a =£ b. In this case, we add the last two equations to get 
2 = a + b (mod 5). 

But this equation cannot hold for our choices of a and b, therefore this case 
can never occur. 
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Case 2. a = b. In this case, subtract the first equation from the second to 
get 


n = b — a (mod 5). 


In this case n is divisible by 5 (which is part of what we wanted to prove). 

Now consider modulo 8. In this case, the table shows that a square is 
either 0, 1, or 4 modulo 8: 


x (mod 8) 

0 

1 2 3 4 5 6 7 

x 2 (mod 8) 

0 

14 10 14 1 


Again, there are nine cases, depending on the values of In + I and 3 n + 1 
modulo 8. These nine cases can be reduced to two exactly as in the modulo 
5 case, and the argument in each case is exactly the same. We conclude 
that 8 divides n, and the proof is complete. 

In congruence arithmetic, the operations of addition, subtraction, and 
multiplication behave as in ordinary arithmetic (except everything is taken 
with respect to the modulus under consideration). What about division? 

We say that a divides b modulo n if there is an integer c such that 
a ■ c = b (mod n). If there is an integer c such that a • c = 1 (mod n), then c 
is called the (multiplicative) inverse of a , sometimes denoted by a~ l . Note 
that if a has an inverse, the equation ax = b (mod n) can be solved by 
simply multiplying each side of the equation by o' 1 ; x = a~ 'b (mod a). 

An important theoretical fact is that an integer a has a multiplicative 
inverse with respect to modulo n arithmetic if and only if a and n are 
relatively prime (see 3.2,21). 

As a special case of the result of the previous paragraph, consider the 
case in which the modulus n is a prime number, say p. In this case, each of 
1,2, — 1 is relatively prime to p, so they all have multiplicative 

inverses. In fact, the numbers Z p = {0,1,2, — 1} can be added, 

subtracted, multiplied, and divided (by nonzero elements), and they form a 
field (see Section 4.4). 


3.2.7. Prove that the expressions 

2x + 3y, 9x + 5y 

are divisible by 17 for the same set of integral values of x and y. 

Solution. It suffices to show that 

2jc + 3y = 0 (mod 17) if and only if 9* + 5y = 0 (mod 17). 
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The plan is to multiply each side of the left congruence by a suitable 
constant so as to transform it into the congruence on the right. So we ask: 
does there exist a constant c such that c(2x + 3y) = 9* + 5y (mod 17)? For 
this to be possible, it is necessary that 2c = 9 (mod 17). Since 2 is relatively 
prime to 17, it has an inverse. It turns out that 2 _l =9, and therefore, 
c = 9x9 = 81 = 13 (mod 17). Therefore, 2x + 3y = 0 (mod 17) implies 

13(2x + 3y) = 0 (mod 17), 

26x + 39y - 0 (mod 17), 

9jc + 5y = 0 (mod 17). 

Conversely, multiply each side of 9x + 5y = 0 (mod 17) by 4 to get 
2x + 3y = 0 (mod 17). 

The next example is a theoretical result which not only is interesting 
from a conceptional point of view, but also has many applications through¬ 
out mathematics. 


3.2.8 (Chinese remainder theorem). If m and n are relatively prime 
integers greater than one, and a and b are arbitrary integers, there exists an 
integer x such that 

x = a (mod m ), 
x s b (mod «). 

More generally, if . . . , m k are (pairwise) relatively prime integers 

greater than one, and a u a 2 , . . ., a k are arbitrary integers, there exists an 
integer x such that 

x = (mod m,-), i = 1,2, . . ., k. 


Solution. Consider the n numbers a,a + m,a + 2m ,. . ., a 4- (n - \)m. 
Each of these is congruent to a modulo m. Moreover, no two of them are 
congruent modulo n. For, if a + im = a +jm (mod n), 0 < i < j < n, then 
(/ — j)m = 0 (mod n). But m and n are relatively prime, so this last 
congruence can hold only if n divides i~j. However, i—j cannot be a 
multiple of n because of the restrictions on i and j. Therefore, i = j. It 
follows that the numbers a,a + m f . . . , & + (n — l)m are congruent in 
some order to the numbers 0,1,2,. . ., n — 1 modulo n. Therefore, for 
some i,a + mi = b (mod n). The proof of the first part is established upon 
setting x = a + mi. 

The more genera] statement can be proved in a similar way, using 
induction on k. (Let c = m k ^ t , and consider a,a + c,a + 2c, 

...,« + (m k — l)c, where a is chosen by the inductive hypothesis so that 
a = (mod w,), i * 1,2, . .., k — 1. Then a + ic = a, (mod m,), i = 
1, . .., k — 1, and no two of the numbers are congruent modulo m k , etc.) 
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3.2.9. Do there exist 1,000,000 consecutive integers each of which contains 
a repeated prime factor? 

Solution. Let p lt p 2 , • ■ ■ ,p 1 , 000,000 denote 1,000,000 distinct prime numbers. 
Then pf and pj are relatively prime if / ^ j, so by the Chinese remainder 
theorem, there is an integer x such that 

x = — k (mod pi J, k = 1,2,. . . , 10 6 . 

It follows that x + k is divisible by pi (i.e., x + k has a repeated prime 
factor), and the answer to the question is yes: take the consecutive integers 
r + l,x + 2, x + 3, ...,x + 1,000,000. 

3.2.10. A lattice point (x,y)GZ 2 is visible if gcd(.x, y) = 1. Prove or 
disprove: Given a positive integer n, there exists a lattice point (u,6) whose 
distance from every visible point is > m. 


Solution. We will look at a very special case first, but the pattern for the 
general case is a simple generalization which will be clear. Begin by 
choosing nine distinct primes p\,p 2 , . . . ,/v We now look for a lattice 
point (a,b) such that 

a - 1 = 0 (mod PiPiP^), 

« = 0 (mod/> 4 /> 5 /j 6 ), (1) 

a + 1 =0 (mod p 7 psp 9 ), 

and 

b + 1 =0(mod/>i/> 4 /j 7 ), 

b = 0(modp 2 p 5 p s ), (2) 

6-1=0 (mod PiPePg)- 

Geometrically, (a, b) is a point characterized by the following diagram: 

Multiple of 


PlPlPl PiPiPt pi p% p<> 

b + 1 
b 

b- 1 


PiPtPi 

Multiple of / p2PsPt 

i P]Pt,P9 


a - 1 


a 


a + 1 
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Since p\P 2 p 3 , PaP^P^ PiP&p 9 are relatively prime, the Chinese remainder 
theorem says that an integer a exists which satisfies equations (1). Similarly, 
since P\P*Pi, p 2 psp%, p 3 /> 6 /> 9 are relatively prime, an integer b exists which 
satisfies (2). By the way a and b are chosen, it is clear that the eight lattice 
points closest to ( a,b ) are invisible. Take, for instance, the point (a,b + 1), 
which has the form (k l p A p f p 6 ,k 2 p l p A p 1 ) for some integers and k 2 . Since 
p A is a common factor of the coordinates, this point is invisible. A similar 
argument applies to the other seven closest lattice points. 

The general case can be handled in exactly the same way, and we leave 
this as Problem 3.2.26. 


Problems 

3.2.11. Prove that any subset of 55 numbers chosen from the set {1,2, 
3,. . . , 100} must contain numbers differing by 10, 12, and 13, but need 
not contain a pair differing by 11. 

3.2.12. The elements of a determinant are arbitrary integers. Determine 
the probability that the value of the determinant is odd. (Hint: Work 
modulo 2.) 

3.2.13. 

(a) Determine whether the following matrix is singular or nonsingular: 

54401 57668 15982 103790 
33223 26563 23165 71489 

36799 37189 16596 46152 ‘ 

.21689 55538 79922 51237. 

(Hint: A matrix A is nonsingular if detA ^ 0. Examine the parity of the 
determinant of the given matrix; that is, compute its determinant 
modulo 2.) 

(b) Determine whether the following matrix is singular or nonsingular: 

64809 91185 42391 44350^ 

61372 26563 23165 71489 
82561 39189 16596 46152 ' 

.39177 55538 79922 51237. 


3.2.14 

(a) Show that 2 lx +1 + 1 is divisible by 3. 

(b) Prove or disprove: 2* = 2 y (mod n) if x =y (mod n). 

(c) Show that 4 3t+1 + 2 3x + l + 1 is divisible by 7. 

(d) If n > 0, prove that 12 divides n 4 - 4n 3 + bn 2 - In. 

(e) Prove that (2903)" - (803)" - (464)" + (261 f is divisible by 1897. 
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3.2.15. 

(a) Prove that no prime three more than a multiple of four is a sum of two 
squares. (Hint: Work modulo 4.) 

(b) Prove that the sequence (in base-10 notation) 

11 , 111 , 1111 , 11111 , . . . 

contains no squares. 

(c) Prove that the difference of the squares of any two odd numbers is 
exactly divisible by 8, 

(d) Prove that 2 70 + 3 70 is divisible by 13. 

(e) Prove that the sum of two odd squares cannot be a square. 

(f) Determine all integral solutions of a 2 + b 2 + c 2 = a 2 b 2 . (Hint: Analyze 

modulo 4.) ^ 1 

3.2.16. 

(a) If x 3 + y 3 = z 3 has a solution in integers x, y,z, show that one of the 
three must be a multiple of 7. 

(b) If n is a positive integer greater than 1 such that 2" + n 2 is prime, show 
that n = 3 (mod 6). 

(c) Let x be an integer one less than a multiple of 24. Prove that if a and b 
are positive integers such that ab = x, then a + b is a multiple of 24. 

(d) Prove that if n 2 + m and n 2 — m are perfect squares, then m is divisible 
by 24. 

3.2.17. Let S be a set of primes such that a,b E S (a and b need not be 
distinct) implies ab + 4 E S. Show that S must be empty. (Hint: One 
approach is to work modulo 7.) 

3.2.18. Prove that there are no integers x and y for which 

x 2 + 3xy — 2y 2 — 122. 

(Hint: Use the quadratic equation to solve for x; then look at the discrimi¬ 
nant modulo 17. Can it ever be a perfect square?) 

3.2.19. Given an integer n, show that an integer can always be found 
which contains only the digits 0 and 1 (in the base 10 notation) and which 
is divisible by n, 

3.2.20. Show that if n divides a single Fibonacci number, then it will 
divide infinitely many Fibonacci numbers. 

3.2.21. Suppose that a and n are integers, n > 1. Prove that the equation 
ax = 1 (mod n) has a solution if and only if a and n are relatively prime. 

3.2.22. Let a, b, c, d be fixed integers with d not divisible by 5. Assume that 
m is an integer for which 

am 3 + bni 2 + cm + d 
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is divisible by 5. Prove that there exists an integer n for which 

dr i 3 + cn 2 + bn + a 

is also divisible by 5. 

3.2.23. Prove that (21 n — 3)/4 and (15/t + 2)/4 cannot both be integers 
for the same positive integer n, 

3.2.24. 

(a) Do there exist n consecutive integers for which the y'th integer, 1 < j 

< n, has a divisor which does not divide any other member of the 
sequence? 

(b) Do there exist n consecutive integers for which the yth integer, 1 < j 

< n, has at least j divisors, none of which divides any other member of 
the sequence? 

3.2.25. Let . . . ,m r be positive integers which are pairwise rela¬ 

tively prime. Show that there exist r -I- 1 consecutive integers s,s + 1, 
. . . ,s + r such that divides s + / for i = 0,1, . . . , r. 

3.2.26. Complete the proof of 3.2.10. 


Additional Examples 

3.3.11, 3.4.3, 3.4.9, 4.1.3, 4.2.4, 4.2.14, 4.3.4, 4.3.5, 4.4.6, 4.4.7, 4.4.8, 4.4.9, 
4.4.19, 4.4.20, 4.4.21, 4.4.22, 4.4.23, 4.4.24, 4.4.29, 4.4.30, 4.4.31. 

3.3. Unique Factorization 


One of the most useful and far-reaching results at the heart of elementary 
number theory is the fact that every natural number greater than one can 
be factored uniquely (up to the order of the factors) into a product of prime 
numbers. More precisely, every natural number n can be represented in one 
and only one way in the form 

« = pVpr ■ ■ ’ Pk k 

where Pi, . .. ,p k are different prime numbers and a l ,a 2 , .. . ,a k are 
positive integers. Here are some easily proved, but very useful, conse¬ 
quences. 


33.1. All the divisors of 
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are of the form 

m = Pi'Pp ' ‘ ' P$i 0 i = 1 , . . . , *, 

and every such number is a divisor of n. It follows that n has exactly 
(a, + 1 \a 2 + 1) • ■ • {a k 4- 1) distinct divisors. 

3 . 3 . 2 . An integer n = p/pP • ■ • p/‘ is a perfect square if and only if a : is 
even for each i, a perfect cube if and only if each a is a multiple of three, 
and so forth. 

333 . Let a,b, .. . , g be a finite number of positive integers. Suppose their 
unique factorizations are 

« = Pi'Pi ! ■ ■ ‘ P?> b =Ptft ‘ ■ -Pk k ’ g = pW-’ - P?, 

where a u .. . , a k ,b \,.... b k , .... g ]f .... g k are nonnegative integers 
(some may be zero). Then 


gcd (a,b, . 

•->£) = 

‘Pi P2 1 ’ 

■ ■ P/\ 

lcm(a,i>, . . 

• > g) Pi ?2 



where m t = minfu,,^,, .. ., &} and M t — max{u,,. .. , g t } for each i 
- 1,2, .. \ k. From this it easily follows that 

gcd (a,b, . . . , g)lcm(cr,h, . .., g) = ab ■ ■ ■ g. 

33.4. Use unique factorization to show that i/2 is irrational. 

Solution. Suppose there are integers r and s such that i/2 = r/s. Then 
Is 2 = r 1 . But this equation cannot hold (by unique factorization), for on the 
left side, the prime 2 is raised to an odd power, and on the right side, 2 is 
raised to an even power (2 occurs an even number of times (perhaps zero) 
in s 2 and r 2 ). This contradiction implies that /2 must be irrational. 


33.5. Find the smallest positive integer n such that n/2 is a perfect square, 
n /3 is a perfect cube, and n/5 is a perfect fifth power. 


Solution. Since n is divisible by 2, 3, and 5, we may assume it has the form 
n = 2 a 3 6 5 e . Then n/2 = 2 a ~ '3*5 C , n/3 = 2“3 i - , 5 r , n /5 = 2 fl 3 , ’5 t - l . The 
conditions are such that a — 1 must be even, and a must be a multiple of 
both 3 and 5. The smallest such a is a = 15. Similarly, the smallest values 
for b and c are b = 10 and c = 6. Thus n — 2 is 3 l0 5 6 is the smallest such 
positive integer. 


33.6. Prove there is one and only one natural number n such that 
2 8 + 2 11 + 2" is a perfect square. 
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Solution. Suppose m 2 = 2 s + 2" 4- 2". Then 

2” = m 2 - 2 8 — 2 11 

= m 2 - 2\ 1 + 2 3 ) 

= m 2 -(3x2 4 ) 2 

= (m — 48)(w + 48). 

Because of unique factorization, there are non negative integers s and /such 
that 


m - 48 = 2 s , m + 48 = 2', s + r = n. 

Thus m = 2 s + 48, m = 2' — 48, so that 

2 s + 48 = 2' - 48, 

2' — 2 s = 96, 

2 s {2‘-°- 1) = 2 5 X3. 

Since 2'~ s - 1 is odd, unique factorization implies that 2'~ s - 1=3. It 
follows that j = 5, / = 7, and n = 12. 


33.7. Let n be a given positive integer. How many solutions are there in 
ordered positive-integer pairs {*, y) to the equation 


xy 

x +y 


= «? 


Solution. Write the equation in the form 

xy = n(x +y), 
xy — nx — ny = 0, 

(.* - n)(y - n) = n 1 . 

Since we want positive integer solutions, it must be the case that x > n and 
y > n (0 < x < n and 0 < y < n imply (x - n)(y - «) < n 2 ). 

Suppose the prime factorization of n is p^'p? 1 ■ ' ' p£ k - Then n 2 
= p] a 'p\ ai • • • pl a ‘. Each divisor of n 2 determines a solution, and therefore 
the number of such solutions is (2 a x + l)(2u 2 +!)•■• (2 a k + 1). 


33.8. Let r and s be positive integers. Derive a formula for the number of 
ordered quadruples («, b, c, d) of positive integers such that 

TT = lcm(n,6, c) = lcm (a,b,d) = lcm (a,c,d) = 1cm (b,c,d). 


Solution. In view of the result of 3.3.3, it is apparent that each of a, b, c, 
and d must have the form 3*7" with m in {0,1, .. ., r} and n in [0, 
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1, . . . , s}. Also, n must be r for at least two of the four numbers, and n 
must be s for at least two of the four numbers. There are fyr 2 allowable 
ways of choosing the m’s in which exactly two m’s will equal r; there are 
(t)r allowable ways in which exactly three m’s. will equal r; there are ( 4 ) 
allowable ways in which all m's equal r. Putting this together, there are 

choices of allowable m's. Similarly, there are 

allowable ri" s. The desired number is therefore (1 + 4r 4- 6 r 2 )(l + 4s + 6 s 2 ). 

33.9. Given positive integers x, y, z, prove that 

(x,yXx,z)(^, 2 )[x,y, 2 ] 2 =[x,y][x,z][>-, 2 ](x, y,zf, 

where (a, . . . , g) and [a, . . . ,g\ denote gcd(a, . . ., g) and lcm(u, .. ., g) 
respectively. 


*■ 

Solution. Because of unique factorization, it suffices to show that for each 
prime p> the power of p on the left side (in its prime factorization) is equal 
to the power of p on the right side. So suppose x = p a r, y = p b s, and 
z = p c t, for integers r,s,t, each relatively prime to p. We may assume 
(because of symmetry, and by relabeling if necessary) that a < b < c. Then 
the power of p in the unique factorization of [x, y, z] 2 is 2c; the powers of p 
in (jt, y), (jc, z), and (y, z) are a, a, and b respectively. Hence the power of p 
on the left side is 2 a + b + 2c. 

In the same manner, the power of p on the right side is b + c + c + 2a 
= la + b + 2c. Thus, by our earlier remarks, the proof is complete. 

33.10. Show that 1000! ends with 249 zeros. 


Solution. Write 1000 != 2°5*r, where r is an integer relatively prime to 10. 
Clearly, a > b, and the number of zeros at the end of 1000 ! will equal b. 
Thus, we must find b. 

Every fifth integer in the sequence 1,2,3,4,5,6,..., 1000 is divisible by 
5; there are [ 1000/5 ] = 200 multiples of 5 in the sequence. Every 25th 
integer in the sequence is divisible by 25, so each of these will contribute an 
additional factor; there are [ 1000/25 1 = 40 of these. Every 125th integer 
in the sequence is divisible by 125, and each of these will contribute an 
additional factor; there are 11000/125 ] = 8 of these. Every 625th integer 
will contribute an additional factor; there are [ 1000/625 ] = 1 of these. 
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Thus, b = [[ 1000/5 ] + [ 1000/25 fl + |[ 1000/125 ] + [ 1000/625 ]| 
= 200 + 40 + 8+1 = 249. 

In exactly the same manner, the highest power of p in m! is given by the 
(finite) sum 


[ n/p ] + [ n/p 1 J + If n/p 3 ]+•••. 


3.311. Prove that there are an infinite number of primes of the form 

6 « — 1 . 


Solution. First, notice that if p is a prime number larger than 3, then either 
p ~ 1 (mod 6) or p = — 1 (mod 6). [If p = 2 (mod 6), for example, then 
p = 6k + 2 for some k, which implies that p is even, a contradiction. A 
similar argument works for p = 3 (mod 6) or p = 4 (mod 6).] 

Now suppose there are only a finite number of primes of the form 
6 n — 1. Consider the number N = p\ — 1, where p is the largest prime of the 
form 6n — 1. Write A as a product of primes, say 

N=p\-\ =p\p 2 ■ ■■p m - (1) 

Observe that each of the primes p k is larger than p. For, if p k < p then 
equation (1) shows that p k divides 1, an impossibility. Since p is the largest 
prime congruent to - 1 modulo 6, it follows that p k = 1 (mod 6) for each k. 
If we now consider equation (1) modulo 6, we find that 

pi - 1 = 1 (mod 6), 

or equivalently, 

p\= 2 (mod 6). 

But this is clearly impossible, since p\ = 0 (mod 6). Therefore, there must be 
an infinite number of primes of the form 6m — 1. 


Problems 

33.12. In a certain college of under 5000 total enrollment, a third of the 
students were freshmen, two-sevenths were sophomores, a fifth were juniors 
and the rest seniors. The history department offered a popular course in 
which were registered a fortieth of all the freshmen in college, a sixteenth of 
all the sophomores, and a ninth of all the juniors, while the remaining third 
of the history class were all seniors. How many students were there in the 
history class? 

33.13. Find the smallest number with 28 divisors. 

33.14. Given distinct integers a,b,c,d such that 

(* - c)(jc - h)(x - c)(jc - d) - 4 = 0 
has an integral root r, show that 4 r — a + b + c + d. 
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(a) Prove that V72 is irrational. 

(b) Prove that there is no set of integers m, n, p except 0,0,0 for which 
m + ri'jl +/j^3 = 0 . 

33.16. Given positive integers a,b,c,d such that a 3 = b 1 , c 3 = d 2 , and 
c - a = 25, determine a, b, c, and d. 

33.17. Prove that if ab, ac, and be are perfect cubes for some positive 
integers a,b,c, then a, b, and c must also be perfect cubes. 

33.18. A changing room has n lockers numbered 1 to n, and all are 
locked. A tine of n attendants P,, P 2 , . .., P n file through the room in 
order. Each attendant P k changes the condition of those lockers (and only 
those) whose numbers are divisible by k: if such a locker is unlocked, P k 
will lock it; if it is locked, P k will unlock it. Which lockers are unlocked 
after all n attendants have passed through the room? What is the situation 
if each attendant performs the same operation, but they file through in 
some other order? 

33.19. The geometry of the number line makes it clear that among any set 
of n consecutive integers, one of them is divisible by n. This fact is 
frequently useful, as it is for example in the following problems. 

(a) Prove that if one of the numbers 2" — 1 and 2" + 1 is prime, n > 2, then 
the other number is composite. 

(b) What is the largest number N for which you can say that n 5 — 5n 3 4- 4 n 
is divisible by N for every integer n? 

(c) Prove that every positive integer has a multiple whose decimal represen¬ 
tation involves all ten digits. 

33.20. For each positive integer «, let H n = 1 + 1 /2 + • • • -f 1/n. Show 

that for n > 1, H„ is not an integer. (Hint: Suppose H n is an integer. 
Multiply each side of the equality by lcm(l,2, and show that the 

left side of the resulting identity is even whereas the right side is odd.) 

33.21. If gcd(n,b) = 1, then show that 

(i) gcd«n + b) m ,(a - b) m ) < 2 m , and 

(ii) gcd{a m -I- b m ,a m — b m ) < 2 . 

33.22. For positive integers a, . . . , g, let (a, . . ., g) and [a, . .. , g] de¬ 
note the gcd(a,. . ., g) and lcm(u,. . ., g) respectively. Prove that 

(a) xyz = (xy, xz, yz)[x, y, z], 

(b) (x[y, z]) = [(x, y), (x,z)\, 

(c) [x,(y,z)\ = (O, 

(d) ([*, y], [x,z],[y,z]) - [(x, y\(x,z), (y,z)\, 

(e) [Jt, y, z](x, y)(x, z)(y, z) = xyz(x> y, z), 

(0 (^/) = (^+/,K>'D- 
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33 . 23 . Let m be divisible by 1,2, .. ., n. Show that the numbers 1 4- 
m(\ + i), i = 0, 1 , 2 , . .., n, are pairwise relatively prime. 

33 . 24 . The prime factorizations of r 4- 1 positive integers (r > 1) together 
involve only r primes. Prove that there is a subset of these integers whose 
product is a perfect square. 

33 . 25 . 

(a) Determine all positive rational solutions of x y = y x . 

(b) Determine all positive rational solutions of x x+y = (x + y} v . 

3.3.26. Suppose that a 2 + b 2 = c 2 , a,b,c integers. Assume gcd(u, 6 ) 
=* gcd(a, c) = gcd (b, c) = 1. Prove that there exist integers w and v such that 
c — b = 2u 2 , c + b = 2 m 2 , gcd(w,«) = 1. Conclude that a = 2«u, b = v 1 — 
u 2 , c — v 2 + u 2 . (Hint: By examination modulo 4, it is not the case that a 
and b are both odd; neither are they both even. So without loss of 
generality, a is even and b is odd.) 

Conversely, show that if u and m are given, then the three numbers a, b, c 
given by the above formulas satisfy a 2 + b 2 — c 2 . 

33.27. Find all sets of three perfect squares in arithmetic progression. 
(Hint: Suppose a < b < c and b 2 — a 2 — c 2 - b 2 , or equivalently, a 2 + c 2 
= 2b 2 . Let s = (c + a)/2, t = (c — a)/2. Show that s 2 + t 2 — b 2 . Now apply 
the result of 3.3.26.) 

33 . 28 . 

(a) Suppose there are only a finite number of primes of the form 6n - 1; 
call them pi,-.-,p k . Reach a contradiction by considering N 

= o>, • * * pkf - 1 - 

(b) Prove that there are an infinite number of primes of the form 4/t — 1. 


Additional Examples 

1.10.9, 1.10.10, 2.6.1, 3.1.4, 3.4.8, 4.1.3, 4.2.3, 4.2.16b, 4.4.9, 5.2.1, 5.2.4, 
5.2.6, 5.2.9, 5.2.14, 5.2.15, 5.2.16, 5.2.17. 

3.4. Positional Notation 


We will assume a familiarity with the positional system of representing real 
numbers. Namely, if b is an integer greater than one (called the base), each 
real number x can be expressed (uniquely) in the positional form 
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where A 0 ,. . ., A„,a 1 ,a 2 , . .. (called the digits) are integers, 0 < A,- < b, 
0 < ctj< b, and there is no integer m such that a k — b — 1 for all k > m. 
This representation is used to denote the sum of the series 

A„b" + A n _ yb n ~ ] + • • • + /f 1 h + j4o + u ] 6 -l + a i b~ 2 + • • • . 


3.4.1. Let C denote the class of positive integers which, when written in 
base 3, do not require the digit 2. Show that no three integers in C are in 
arithmetic progression. 


Solution. Let d denote the common difference for an arbitrary arithmetic 
progression of three positive integers, and suppose that when d is written in 
base 3 notation its first nonzero digit, counting from the right, occurs in the 
/rih position. Now, let a be an arbitrary positive integer, and write it in base 
3 notation. The following table gives the &th digit of each of the integers a, 
a + d, and a + Id, depending upon the &th digit of d and a: 


]f the kth digit of d is 1 
and 

'\ kth digit of a is 

Then the \ 

kth digit of \ 

\ 0 1 2 

kth digit of d is 2 
and 

kth digit of a is 

0 ] 2 

a 

0 ] 2 

0 ] 2 

a + d 

1 2 0 

2 0 ] 

a + 2d 

2 0 1 

] 2 0 


In every case, one of a, a + d, a + 2d has a 2 in the fcth digit, which means 
the corresponding number does not belong to C. 

3.4.2. Does [*]] + [ 2xfl+[4x]| + [8*1+116* 1 + [32x ] = 12345 
have a solution? 


Solution. Suppose that x is such a number. It is an easy matter to show that 
195 < * < 196 (since 63 X 195 = 12,285 < 12,345 < 12,348 = 63 x 196). 
Now, write the fractional part of x in base-2 notation (the a,b,c, . . . are 
either 0 or 1 ): 


x = 195 + .abcdef .. . . 
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Then 

2x — 2 X 195 + a.bcdef . . . , 

Ax = 4 X 195 + ab.cdef .. ., 

8 x = 8 X 195 + abc.def. . . , 

16* = 16 X 195 + abcd.ef. , ., 

32* = 32 X 195 + abode.f .... 

In this form we see that 

lx ]« 195, 

[2* ] = 2 x 195 + a, 

[4* ] = 4 X 195 + 2a + *, 

[ 8 * ] = 8 x 195 + Aa + 2b + c, 

[16x]« 16X 195 + 8£i+4* +2c+ </, 

132* J = 32 X 195 + 16a + 8 * + Ac + 2d + e. 

Adding, we find that ff* J + [2* J + [4* ] + | 8 * ]| + [ 16* J + f32* j 
= 63 X 195 + 31a + 15* + 7c + 2d + e. The problem is therefore reduced 
to finding a,b,c,d,e, each 0 or 1, such that 31 a + 15* + 7c + 2>d + e — 60. 
But this equation cannot hold under the restrictions on a,b,c,d > e, since 
31a + 15* + 7c + 3d + e < 31 + 15 + 7 + 3 + 1 = 57 < 60. Therefore, 
there can be no such *. 

When an integer is written in decimal notation (base 10), it is possible to 
determine very easily if it is divisible by 2 or 5. There are other divisibility 
tests that are easy to apply. For example: An integer N is divisible by 4 if 
and only if its last two digits are divisible by 4. To see this, write N in base 

10 : 


N = (a n l(T + +« 2 10 2 )+(a l 10 + a 0 ) 

and note that a n l0" + ■ ■ • + ajIO 2 is always divisible by 4. Thus, 4| N if 
and only if 4 1 («, 10 + a 0 ). 

One of the most striking and useful divisibility tests is that an integer is 
divisible by 9 if and only if the sum of its digits (in decimal notation) is 
divisible by 9. To see why this is so, notice that 10= 1 (mod 9), and 
therefore, by the properties of modular arithmetic, 10 2 = 1 (mod 9), 10 3 = 1 
(mod 9), and so forth. It follows that 

N = a n lCF + * • • + a, 10 + a 0 = a n + • • • + a, + a 0 (mod 9). 

A similar proof shows that an integer is divisible by 3 if and only if the 
sum of its digits is divisible by 3. As an application of this test, suppose we 
ask: for what digits * is 4324*98765223 divisible by 3? We simply need to 
add the digits modulo 3, and choose * that will make the sum congruent to 
zero modulo 3. In this case, the sum of the digits is 1 + * modulo 3, so the 
number is divisible by 3 if and only if * = 2, 5, or 8 . 
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3 . 43 . When 4444 4444 is written in decimal notation, the sum of its digits is 
A . Let B be the sum of the digits of A . Find the sum of the digits of B. (A 
and B are written in decimal notation.) 

Solution. Let N = 4444 4444 . Then N < (lO 5 ) 4444 = 10 22120 , which means that 
when N is written in decimal notation, it will have less than 22,220 digits. 
Since each of the digits of N must be less than or equal to 9, we are certain 
that A < 22,2JK> X 9 = 199,980. 

In a similar manner, A has at most 6 digits, so that the sum of the digits 
of A must be less than 54 ( = 6 X 9); that is, B < 54. 

Of the positive integers less than 54, the number with the largest sum of 
digits is 49, and this sum equals 13. Let C denote the sum of the digits of B. 
We have just seen that C < 13. 

From our reasoning preceding the problem, we know that 

N = A = B = C (mod 9), 

so let us calculate the congruence class of C by calculating the congruence 
class of N. First, 4444 = 9 x 493 + 7, and therefore 4444 = 7(mod 9). Also, 
7 3 = 1 (mod 9). Since 4444 = 3 X 1481 + 1, we have 

44444444 = 74444 ^ mod 

= 7 3X148| x 7 (mod 9) 

= 7 (mod 9). 

Thus, C = 7 (mod 9) and C < 13. The only number which can satisfy both 
of these requirements is C = 7, and the problem is solved. 


3 . 4 . 4 . An (ordered) triple (jr,,x 2 ,jc 3 ) of positive irrational numbers with 
x | + x 2 + = 1 is called balanced if each jc ( - < ^ . If a triple is not balanced, 
say if Xj >^, one performs the following “balancing act”: 

where x' = 2x : if i ^ j and xj = 2xj — 1. If the new triple is not balanced, 
one performs the balancing act on it. Does continuation of this process 
always lead to a balanced triple after a finite number of performances of 
the balancing act? 


Solution. Write jc,,jc 2 ,x 3 in base 2 notation in the manner described at the 
beginning of the section, say 

jc, =.a,a 2 flj. . . , 

*2 =.*,* 2*3 * • • > 

X} = .CjCjCj . .., 

where a t , b i , are each 0 or 1 . 



no 
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To say that each x, <j is to say that a i , b } , and c, are each equal to 
zero. Notice that the balancing act consists of moving the “decimal” point 
one place to the right and then disregarding the integer part. Thus, for 
example, if x,,x 2 ,x 3 were not balanced, the representations (base 2 ) of 
X|,x 2 ,x 3 are given by 

X\ = ,a 2 a 2 a A • ■ ■ , 

x' 2 = ,b 2 b 2 b A .... 

*3 = ^ 2^4 - 

Many examples can be given to show that the process need not termi¬ 
nate in a balanced triple. For example, define X|,x 2 ,x 3 (using the earlier 

1 if i is a perfect square, 

0 otherwise, 

1 if i is one more than a perfect square, 

0 otherwise, 

1 if a i + b, = 0 , 

0 otherwise, 

X, =.100100001000000100 . . . , 

x 2 =.010010000100000010 _ 

x 3 =.001001110011111001 .... 

Each of x,, x 2 , and x 3 is irrational (rational numbers are those which 
correspond to periodic “decimal” representations), and their sum is 1 (since 
X] + x 2 + x 3 = \ + \ + £ + • * * =1). Repeated applications of the balanc¬ 
ing act will never transform x l ,x 2 ,x 3 into a balanced triple (because, in 
every case, one of a l ,b i ,c i is equal to 1 ). 

3 . 4.5 (Continuation of 2 . 5 . 10 ). Suppose / is a function on the positive 
integers which satisfies 

f(2k) = 2f(k)-l, 

/(2* + l)«2/(*) + l. 

Let a be an arbitrary positive integer whose binary representation is given 
by 

a = a n a n -1 • * -fljaiOo (= «„2"+ a a _ [2 fl “ 1 + ■• ■ + a { 2 + a 0 ). 


notation) by 

a ; = 

bi- 

= 

that is. 


f(a)=b a 2" + b n ^ 2 "-'+ +b l 2 + b 0 . 


Show that 
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where 


(The idea is to replace each of the 0’s in the binary sum for a with — l’s; for 
example, for n = 10 , /( 1010 2 ) = lTlT 2 (the T’s stand for — Ts) = 8 — 4 + 
2-1=5. 

Solution. We will induct on the number of digits in the binary representa¬ 
tion of a. 

The result is true for a = 1 , so suppose it holds whenever a has fewer 
than k + 1 digits. Now consider an integer a with k + 1 digits (in base 2 ), 
say 

a = <*k<*k -1 • * -Oj^Oo- 

If a Q = 0 , then a = 2 {a k a k _ , . . . /(«) = 2 f(a k ... a,) - 1 = 

2[b k 2 k ~' + ■ • • + b 2 2 + b,J - 1 - 6*2* + * * * + b 2 2 2 + 6 ? + b 0 , and the 
result holds. If a 0 = 1, then a = 2{a k a k _, ... a,) + 1, f{a) = 
2 f{a k 1 - 2 <&* 2 *-' + • ■ • +b i )+\ = b k 2 k + ■ ■ • +b t 2 + b a , 

and again the result holds. 

This is a nice application of number representations. Notice how simple 
it is to compute: /(25) = /(11001 2 ) = I ill 1 2 = 16 + 8 - 4 - 2 + 1 = 19. 

In the next example, a special number representation allows us to 
investigate and understand a set of real numbers of central importance in 
advanced analysis. 


3.4.6. Let K denote the subset of [0,1] which consists of all numbers 
having a ternary expansion 



in which a n = 0 or 2. This is called the Cantor set. Show that K is the 

complement of the union of disjoint open intervals n= 1,2,3. 

whose lengths add to 1 . 


Solution. First observe that none of the numbers in the interval /, = (^,§) 
are in K. This is because numbers in this interval have ternary representa¬ 
tions of the form 

(.la 2 a 3 a, . .. ) 3 . 

Similarly, none of the numbers in the interval I 2 = (£, \) are in K, 
because these numbers have ternary representations of the form 

(.01 a 2 a 4 a 5 . .. ) 3 - 
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0 [ 2 1 
9 9 3 


2 7 8 1 

3 9 9 


Figure 3.2. 


Also, numbers in the interval / 3 — (|,f) have ternary representations of the 
form 


(.21o 3 a 4 « 5 

so these are not in K. In the same manner, none of the intervals / 4 
= (£,£), *s = (27*2?). h = (£>&)> /? = (§3.27) contain elements in K. 

It is apparent that this process can be carried out systematically. Figure 
3.2 and Table 3.1 help make the idea precise. 

To find /„ (that is, X n and Y„) for an arbitrary positive integer n, write n 
in base 2 notation: 


(i.e., n = a, + la 2 + ■ ■ ■ + 2 k a k , a, = 0 or 1), let b t = 2 a it * = 1,2. k, 

and set I n = (X„, Y„), where 


-a n 

II 

b 2 

■ + -f + 

3 2 

... + bk ~ 1 
3*-‘ 

+ Ji - ('*1*2 ' ' 


It 

Ujl.?" 

h 

■ + -4 + 
3 2 

1 1 

+ 


• V,2) 3 . 

It is easy to see that X r 

, and Y n are 

elements of K for each n (note that 

x.-b l /l + b 2 /#+ ■■■' 

+ (**-i)/(3' 

1(2/3*n and that no el- 

ements in I„ are 

in K (the /cth digit of every element of I„ = (X„, Y n ) is 1). 

From these facts it follows that the 

5 are disjoint. 




Table 3.1. /„ 



n 

n 

x n 

Y„ 


(base 10) 

(base 2) 

(base 3) 

(base 3) /„ (in fractional form) 

1 

1 

0.1 

0.2 

(i.f) 

2 

10 

0.01 

0.02 

U > 5 ) 

3 

11 

0.21 

0.22 

(l,§) 

4 

100 

0.001 

0.002 

(A,*) 

5 

101 

0.201 

0.202 

(#,») 

6 

no 

0.021 

0.022 

(?7 >$r) 

7 

in 

0.221 

0.222 


8 

1000 

0.0001 

0.0002 

(*>*> 

9 

1001 

0.2001 

0.2002 

(».») 
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Also, the length of /„ is 1 /3*, where k = [ log 2 n ], and therefore 

- 2 m * 1 — ] 




i 


= 2 


( 3 11 loejn 11 +1 ) 


n %" 3 R lo «2' 1 1 +1 

=l. 2 1 

Our construction of the /„’s makes it clear that K is what remains after 
the intervals I n are removed from [0,1], and the result is proved. 


Problems 

3.4.7. Prove that there does not exist an integer which is doubled when the 
initial digit is transferred to the end. 

3.4.8. Find the smallest natural number n which has the following proper¬ 
ties: 

(i) its decimal representation has a 6 as its last digit, and 

(ii) if the last digit 6 is erased and placed in front of the remaining digits, 
the resulting number is four times as large as the original number n. 

3.4.9. 

(a) Solve the following equation for the positive integers x and y : 

(360 + 3x ) 2 = 492/04. 

(b) Devise a divisibility test for recognizing when a number is divisible by 
11. (Hint: 10 s -1 (mod 11).) 

(c) If 62uM27 is a multiple of 99, find a and b. 

(d) Find the probability that if the digits 0,1*2,. . ., 9 are placed in 
random order in the blank spaces of 5_383_8_2_936_5_8_203_9_3_76_, 
the resulting number will be divisible by 396. 

3.4.10. Given a two-pan balance and a system of weights of I,3,3\3 3 , 
3 4 , .. . pounds, show that one can weigh any integral number of pounds 
(weights can be put into either pan). (Hint: Show that any positive integer 
can be represented as sums and differences of powers of 3.) 

3.4.11. 

(a) Does the number 0.1234567891011121314 .... which is obtained by 
writing successively all the integers, represent a rational number? 

(b) Does the number 0.011010100010100 . . ., where a„ = 1 if n is prime, 0 
otherwise, represent a rational number? 

3.4.12. Let S = a 0 a,a z . .., where a„ = 0 if there are an even number of 
l’s in the expression of n in base 2 and a H = 1 if there are an odd number of 
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l’s. Thus, S = 01101001100 .... Define T= b^b^ . . . , where b, is the 
number of l’s between the /th and the (i + l)st occurrence of 0 in S. Thus, 
T = 2102012 .... Prove that T contains only three symbols 0,1,2. 

3.4.13. Show there is a one-to-one correspondence between the points of 
the closed interval [0,1] and the points of the open interval (0,1). Give an 
explicit description of such a correspondence. 


Additional Examples 

1.1.1 (Solution 5), 4.4.8, 5.2.5, 6.1.1, 6.1.4, 6.1.8, 6.2.13, 7.6.6. 

3.5. Arithmetic of Complex Numbers 


Recall that a complex number z can be written in several different forms: 
rectangular form: z — a + bi, 

polar form: z = /-(cos 0 + i sin 6 ), 

exponential form: z = re' 6 , 

where a, b, r , and 6 are related as in Figure 3.3, and e‘ 9 = cos 0 + i sin 0. 
The angle 9 is the argument of z (determined only up to a multiple of 2tt), 
and r is the magnitude (absolute value) of z; these are denoted by argz and 
jz| respectively. The numbers a and b are called the real part and the 
imaginary part of z respectively, and are denoted by Re(z) and Im(z). 

If z = a + bi and w = c + di, then z + w = (a + c) + i(b + d) corre¬ 
sponds geometrically to the diagonal of the parallelogram having z and w 
as adjacent sides (see Figure 3.4). 

If z = re' 9 and h> = se"*, then zw = rse i(6+,f) . Notice that |zw| = rs 
= |z| |>v| and argzvv = 0 + <p = argz + argw; that is, under multiplication, 
the absolute values multiply and the arguments add. 
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3.5.1. If a, b, and n are positive integers, prove there exist integers x and y 
such that 


( a 2 + b 2 ) = x 2 + y 2 . 


Solution. Let z = a 4- bi. Then {a 2 + b 2 )” = (|z| 2 )" = |z| 2n = (|z|") 2 . But z" 
= x + iy for some integers x and y (because a and b are integers), so 
(|z"|) 2 = \x + iy\ 2 = jc 2 4- v 2 , and the proof is complete. 


3.5.2. Let n be an integer > 3, and let a, fi, y be complex numbers such 
that a” = fi" = y" = 1, a 4- fi + y = 0. Show that n is a multiple of 3. 

Solution. We may assume without loss of generality that a — I (for if not, 
divide each side of a + fi 4- y = 0 by a to get 14- fi/ a 4- y/a = 0, and 
then set a, = 1, /?, = fi/a, y, = y/a). We will assume that 0 < arg fi 
< argy < 27 t. 

Now, fi and y are of magnitude 1 (since fi" = y" = 1), so they lie on the 
unit circle (center (0,0), radius 1). From the equation fi + y = — 1, we can 
equate imaginary parts to see that Im(/I 4-y) = Im(/I) 4-Im(y) = 0, or 
equivalently, Im( fi) *= — Im(y) (Figure 3.5). Equating real parts yields 



Figure 3.5. 
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Re( j8) + Re(y) = — 1. Since we also have established that | /31 = |y| = 1, it 
must be the case that Re( (i) = Re(y) = - j, and therefore j8 = e 2l "- n , and 
y = e 4m /\ The fact that ji n = 1 implies that e 2 ™^ = 1, and this can 
happen only if a is a multiple of 3. 

The following result is very useful; it can be proved by induction. 

De Moivre’s Theorem, For each integer n, 

(cos# + i sin# )" = cos n# + i sin a#. 

(In exponential notation, (e‘ e ) n = e‘ n9 .) 

3.53. Express cos5# in terms of cos#. 

Solution. An efficient way to do this is to recognize that cos 5# is the real 
part of e 5ie . Then we can apply De Moivre’s theorem: 

cos 5# + i sin 5# = (cos 6 + i sin 6 ) 5 

= cos 5 # + 5 cos'!# (< sin 9 ) + 10 cos 3 # (rsin 2 #) 

4- lOcos 2 # (f' 3 sin 3 #) + 5 cos#(/ 4 sin 4 #) + rsin 5 # 

= (cos 5 # - 10cos 3 #sin 2 # + 5cos#sin 4 #) 

+ / (sin 5 # — 10sin 3 #cos 2 # + 5sin#cos 4 #)., 

Equating real and imaginary parts, we get 

cos 5# = cos 5 # - 10 cos 3 # sin 2 # + 5 cos # sin 4 #, 
sin5# = sin 5 # - 10sin 3 #cos 2 # + 5sin#cos'!#. 

For the case of cos5#, 

cos5# = cos 5 # — 10cos 3 #(l — cos 2 #) 4- 5cos#(l - cos 2 #) 2 
= 16cos 5 # - 20cos 3 # 4- 5cos#. 

3.5.4. Find constants a 0 ,a ls . . ., a 6 so that 

cos 6 # = o e cos6# 4- o s cos5# 4- - - • 4- £j,cos# + a 0 , 

Solution. As in the last problem, we can do this very nicely by exploiting 
the relationship between trigonometric functions (especially the sine and 
cosine) and complex variables. In this case, write cos# in the form 
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and apply the binomial theorem to get 

-p[( e '')‘ +6 ( e ") ! ( e_ ') + >v")V _, ') ! 

■ +20(e ie f(e- w f + 15(e , ' e ) 2 (e-'' 9 ) 4 + 6(e' (, )(e- /e ) 5 + (e-' <1 ) 6 ] 
= J- [(e 6 * + e ” 6,e ) + 6(e 4 * + e~ 4ie ) + I5(e w + e~ 2i9 ) + 20] 

= i [2cos60 + 2 X 6cos40 + 2 X 15cos2# + 20] 

= ^ [cos 60 + 6cos4f? + 15 cos20 + 10]. 

3.5.5. Let G n = x"sin/r4 4- y n $innB + z"sinrtC, where x, y,z,A,B, C are 
real and A + B + C is an integral multiple of ir. Prove that if G, = G 2 = 0, 
then G„ = 0 for all positive integral «. 

Solution. A standard trick (similar to 3.5.3) is to recognize that G„ is the 
imaginary part of the expression 

H„ = x n e inA + y n e ,nB + z R e ,nC . 

Suppose that H n is real for n = 0,1, .. ., k, and consider H k + r We have 

H i H k = H k + ] + H, 

where 

H — xe‘ A y k e‘ kB + xe ,A z k e' kc + ye ,B x k e ikA 
+ye' B z k e ,kc + ze‘ c x k e' kA + ze' c y k e‘ kB 
= xye i{A + a \y k -'e Kk ~' i>B + x k ~ 'e i(k ~ IM ] 

+ xze i{A + c >[7 fr - 1 e' <i “ l>c + x k ~ 'e ,yk ~ |M ] 
+yze iiB+C) [y k ~'e i(k ~' }B + z k ~ 'e i(k ~ l)C ] 

= xye-< A + B )[H k _ t - z k -'e i<k ~'> c ] 

+ xze i < A + c )[H k _ l -y k -'e t < k ~" B ] 

+yze* B + c )[fi k l - 

= H k . l [xye i ^ A + B ^ + xze BA+C) + yze* B+C) ] - xyze BA + B + c >H k _ 2 
= H k _ l K~xyze i < AABAC ^H k _ 2 , 
where K - xye t(A + B) + xze ilA + C} +yze i(B+C} , 
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Observe that H 2 = Hf + 2 K, and since H { and H 2 are real, by hypothe¬ 
sis, it must be the case that K is real. Also, by the inductive assumption, 
H k __ l and H k _ 2 are real. Because A + B + C is a multiple of tt, e ,(A + B+C) 
is real. Putting these facts together, the formula of the last paragraph shows 
that H is real. Now since H k is real, by the inductive assumption, and since 
H k + l = H } H k — H, it follows that H k + , is real. Thus, the result of the 
problem follows by mathematical induction. 


Problems 

3.5.6. 

(a) Given that 13 = 2 2 + 3 2 and 74 = 5 2 + 7 2 , express 13 X 74 = 962 as a 
sum of two squares. (Hint: Let z = 2 + 3/, w = 5 + 7i, and use |z| 2 |w| 2 

= M 2 *) 

(b) Show that 4 arc tan ^ - arc tan ^ (Hint: Consider (5 - i) 4 (l + /)•) 

3.5.7. Suppose A is a complex number and n is a positive integer such that 
A " = 1 and (A + 1)” = 1. Prove that n is divisible by 6 and that A 3 = 1. 

3.5.8. Show that 


and 


(Hint: Consider (1 + /)".) 

3.5.9, By considering possible magnitudes and arguments, 

(a) find all values of V~~i ; 

(b) find which values of (3 - 4i)~ 3/8 lie closest to the imaginary axis. 

3.5.10. 


(a) Prove that if x — x “ 1 = 2i sin# then x n — x ” = 2/sin nd. 

(b) Using part (a), express sin 2 "# as a sum of sines whose angles are 
multiples of 6. 

3.5.11. Show that 

(")ta n «-(”)unV+ - 


tann# 
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3.5.12. 

(a) Prove that 


, n n+i/«\ cosfcfl _ f 0 if n is odd, 

U'J cos *# [ ( — l) l + " /2 tan"^ if n is even 

(Hint: Consider i tan# = — 1 4- (cos# + /sin#)/cos#.) 

(b) Prove that 

Jl, k tt\ k* „ [( — l) lB+l> ' /2 sin' I #sinn# if n is odd, 

2 (-1) (?)cos a #cosA:#= v } 

*=o f — n l+n/ W#cosn# if«is even. 


<£=o XK! ^( — l) "'sin' 1 # cosh# if n is even. 

(Hint: Consider - 1 + cos#[cos# + isin#] = i sin#[cos 0 + i sin#].) 


3.5.13. Prove that 




3.5.14. Show that if e‘ e satisfies the equation z n + a n _ ] z"~' + ■■■ + 
a l z + a 0 = 0 , where the a i are real, then a„_ ,sin# + a rt _ 2 sin2# + * * * 
4- tfjStnfn — 1)# + a^in/t# = 0. 


Additional Examples 

1.3.2, 4.2.10, 4.2.11, 4.2.13, 4.2.15, 4.2.17, 4.2.20, 4.2.22, 4.3.18, 5.2.2, 5.2.3, 
5.2.11, 5.3.4, 5.3.10, 5.4.11, 5.4.28, 5.4.29. Also, see Section 8.4 (Complex 
Numbers in Geometry). 



Chapter 4. Algebra 


Algebra is one of the oldest branches of mathematics, and it continues to 
be one of the most active areas of mathematical research. The subject is 
still rich in new ideas, and it shows no signs of soon becoming exhausted or 
barren. 

In high-school algebra one learns to manipulate equations and formulas 
into equivalent forms which are more understandable and interpretable. A 
large proportion of the problems in this book attest to the usefulness of this 
basic subject. One of the most important algebraic manipulations involves 
factorization of algebraic expressions. In the first section we will look at 
problems whose solution depends upon knowing some elementary factor¬ 
ization formulas. 

The middle two sections are devoted to problems from classical algebra: 
namely, the study of polynomials. Much of this material once belonged to a 
branch of algebra called the theory of equations. The rudiments of this 
subject are now scattered throughout the high-school and college curricu¬ 
lum. In these sections we draw together the ideas of this subject that 
constitute essential knowledge for problem solving. 

In the final section we introduce those topics which professional mathe¬ 
maticians think of when they speak of algebra. Here the emphasis' is on 
formal systems and formal thinking. The subject contains a whole new 
world of concepts which generalize the classical ideas and methods. We 
introduce the most fundamental structures that make up the subject mat¬ 
ter: groups, rings, and fields. 


1 
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4.1. Algebraic Identities 


In this section we will look at applications of some of the most basic 
factorization formulas, which include the following: 

a 2 - b 2 = (a - b)(a + b), 

a 2 + lab + b 2 = (a + b) 2 , 

a 2 + b 2 + c 2 + lab + lac + Ibc = (a + b + cf, 

a’'-b" = (a-b)(a'- l + a''- 2 b + ■■■ + ab- 2 +b’- 1 ). 

If n is an odd positive integer, we can replace b by — b in the last formula 
and get a formula for the factorization of the sum of two perfect nth 
powers: 

a n + b" = (a + b)(a"-' - a”~ 2 b + ■ ■ ■ - ab"~ 2 + 6"" ! ), n odd. 


4.1.1. Show that n 4 — 20n 2 + 4 is composite when n is any integer. 


Solution, The idea is to try to factor the expression. If we proceed 
m 4 — 20m 2 + 4 = (m 4 -- 20m 2 + 100) — 96 = (n 2 — 10) 2 — 96, we are stymied 
because 96 is not a perfect square. It does work, however, to argue that 
m 4 — 20m 2 + 4 = (m 4 — 4m 2 + 4) — 16m 2 = (n 2 — 2) 2 — (4m) 2 = (n 2 — 2 — 4m) 
X (m 2 — 2 + 4m). If we can show that neither of these factors equals + 1, we 
are done. 

Suppose m 2 — 2 — 4n = 1; or equivalently, m 2 — 4m — 3 = 0. By the qua¬ 
dratic formula, m = 2±/7, and this is not an integer. Thus, if m is an 
integer, « 2 - 2 - 4m is not equal to 1. A similar argument works for the 
other three cases. 

4.1.2. Determine all solutions in real numbers x, y, z, w of the system 

x + y + z = w, 

I+I+l-l. 

x y z w 


Solution. Some initial guesses lead us to suspect solutions only when one of 
x, y, z is equal to w and the other two are negatives of one another (for 
example, x ■= w, y = ~ z). Certainly, these are solutions, but how can we 
prove there are no others? 

From the second equation, 

yZ + XZ + xy _ 1 
xyz w ’ 
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and this, together with the first equation yields 

(a + y + z)(yz + xz + xy) = xyz. 

This expands to 

x^y + x 2 z + y 2 x + y 2 z + z 2 x + z^y + 2 xyz = 0, 
which in turn factors into 

(*+y)(* + z)(.y+z) = 0. 

Our initial conjecture follows (i.e. one of x + y,x + z, y + z equals zero, 
say y = — z, and thus jc = *+ j + z = w). 

4.13. 

(a) Find all pairs ( m,n ) of positive integers such that |3 m — 2"| = 1, 

(b) Find all pairs ( m,ti ) of integers larger than 1 such that \p m — q n \ = 1, 
where p and q are primes. 

Solution, (a) When m = 1 or 2 we quickly find the solutions 
(«,«)-(!,1),(1,2), (2,3). 

We will show there are no others. 

Suppose that ( m,n ) is a solution of |3 m - 2"| = 1, where m >2 (and 
hence n > 3). Then 3 m - 2" = 1, or 3 ffl - 2" = - 1. 

Case l. Suppose 3 m — 2" = — 1, « > 3. Then 3 m = — 1 (mod 8). But this 
congruence cannot hold, since 3 m = 1 or 3 (mod 8), depending upon 
whether m is even or odd (3 == 3 (mod 8), 3 2 = 1 (mod 8), 3 3 = 3 (mod 8), 
3 4 = 1 (mod 8),. . . ). 

Case 2. Suppose 3 m - 2" = 1, n > 3. Then 3 m = 1 (mod 8), so m is even, 
say m = 2k, k> l. Then 2" = 3 2k - 1 = (3* - 1X3* + !)• By unique factor¬ 
ization, 3* + 1 = 2 r for some r, r > 3. But, by case 1, we know this cannot 
happen. This completes the proof of part (a). 

(b) ft is immediate that not both p and q are odd, for this would imply 
that p m — q" is even. So suppose that q = 2. We will show, by using only 
the algebraic identities of this section, that the only solution is that found in 
part (a), namely |3 2 — 2 3 | = 1. 

Suppose m and n are larger than 1, and that \p m - 2"| = 1. It cannot be 
the case that m and « are both even, for if m — 2r and n = 2s, then 

1 = \p m ~ 2"| - | p 2r - 2 2l | = \p' - 2 J | | p' + 2% 

and this is impossible (since p r + 2 s > 1). 

Suppose that m is odd. Then 

2"-p m ±\-(p± \)(p m ~' + p m ~ 2 + ■■ ■ -p+ 1), 

and this is impossible, since the last factor on the right side of the equation 
is an odd number larger than 1. 

Therefore, it must be the case that m is even and n is odd. 
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Suppose m = 2 r k, where k is odd and suppose k > 1. Then 

2" = p m ± 1 = (p*) k ± 1 = (p* ± 1)((^)*~ 1 ± ■ • • - ( p r ) + l), 

and again the factor on the right is odd, a contradiction. 

Therefore, m = 2 r for some positive integer r and n is odd, and our 
equation has the form | p 2 ' - 2"| = 1. Either p r — 2" = 1 or p Y — 2 n — — 1. 
Case 1. If p 1 ' — 2" = — 1, then 

p 2 ' = 2 n - 1 = (2 - 1)(2" _1 + 2" _2 + • • ■ +2 + 1) = 3 (mod 4), 

but this is impossible, since for any odd integer jc, jc* 2 = 1 (mod 4). 

Case 2. If/>*-2" « I, then 2" =/' - 1 = (/> 2 '" - \){p 2 "' + 1). The 
only way both p 1 ' — 1 and p r + 1 could be powers of 2 is for p 2 ' ’ — 1 
= 2 and p T ~' +1=4. Adding these yields p 2 ' ' = 3, and this implies that 
p = 3, r = 1, m = 2, and n = 3. This completes the proof, 

4.1.4. Prove that there are no prime numbers in the infinite sequence of 
integers 

10001,100010001,1000100010001,... . 

Solution. The terms of the sequence can be written as 

1 + 10 4 ,1 + 10 4 + 10 8 ,. .. , I + 10 4 + * • • + 10 4 ",.... 

Consider, more generally, then, the sequence 

1 + X 4 , 1 + Jt 4 + X s , _1 + x 4 + * • • + x 4 *, . . . 

for an arbitrary integer x, x > 1. 

If n is odd, say n = 2m + 1, . 

1 + x 4 + x 8 + • * * + x M2m + t '> 

= (1 + x 4 )(l + x 8 + - • • + x 8m ). 

Thus, if m > 0, the number is composite. For m = 0 and x = 10, we also get 
a composite number, since 10001 = 73 X 137, 

Suppose n is even, say n = 2m. Then 

1 + x 4 + • • • + x* 2m) = -- 1 } - - 

1 -x 4 

_ ( l-(x 2m+l ) \( 1 +(x 2m+1 ) 2 
= \ l-x 2 1 + X 2 

-(l + ^+... + (**)") 

This factorization shows the number is composite. 
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Problems 

4.1.5. 

(a) If a and b are consecutive integers, show that a 2 + b 2 + (ab) 1 is a 
perfect square. 

(b) If 2 a is the harmonic mean of b and c (i.e., 2a = 2/(1 jb + 1/c)), show 
that the sum of the squares of the three numbers a, b, and c is the 
square of a rational number. 

(c) If N differs from two successive squares between which it lies by x and 
y respectively, prove that jV — xy is a square. 


4.1.6. Prove that there are infinitely many natural numbers a with the 
following property: The number n A + a is not prime for any natural 
number n. 

4.1.7. Supposing that an integer n is the sum of two triangular numbers, 

« = ° 2 + a a- bl + b 
2 2 ’ 

write 4« + 1 as the sum of two squares, An + 1 = x 2 + y 2 , and show how x 
and y can be expressed in terms of a and b. 

Show that, conversely, if An + 1 = x 2 +y 2 , then n is the sum of two 
triangular numbers. 

4.1.8. Let N be the number which when expressed in decimal notation 
consists of 91 ones: 


IV - 111 ... 1 . 

91 

Show that N is a composite number. 

4.1.9. Prove that any two numbers of the following sequence are relatively 
prime: 

2+ 1,2 J + 1,2*+ 1,2 s + 1, . . . ,2 r + 1, . . . . 

Show that this result proves that there are an infinite number of primes. 

4.1.10. Determine all triplets of integers (jc, y, z) satisfying the equation 

* 3 + / + z 3 = (*+ / + z ) 3 . 


Additional Examples 

1.8.4, 1.12.7, 3.3.6, 4.2.5, 5.2.15, 5.3.7, 7.1.11. Also, see Section 5.2 
(Geometric Series). 
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4.2. Unique Factorization of Polynomials 


A polynomial of degree n (n a nonnegative integer) in the variable x is an 
expression of the form 

a n x " + tJ„_ 1 x n “ 1 + • • • + + a 0 , 

where a 0 ,a Y ,. . ., a n are constants (called the coefficients), and a n ¥= 0. A 
polynomial all of whose coefficients are zero is called the zero polynomial ; 
no degree is assigned to the zero polynomial. The coefficient a n is called the 
leading coefficient; if it is equal to 1 we say the polynomial is a monic 
polynomial. Two polynomials are called (identically) equal if their coeffi¬ 
cients are equal term for term, that is, their coefficients for the same power 
of the variable are equal. 

If the coefficients of the polynomial P(x) are integers, we say that P(x) 
is a polynomial over the integers; similarly if the coefficients are rationals, 
we say the polynomial is over the rationals, and so forth. 

In many respects polynomials are like integers. They can be added, 
subtracted, and multiplied; however, just as in the case of integers, when a 
polynomial divides another the result will be a quotient polynomial plus a 
remainder polynomial (more on this later). A polynomial F divides a 
polynomial G (exactly) if there is a polynomial Q such that G = QF (that 
is, G is a multiple of F). A polynomial H is a greatest common divisor of 
polynomials F and G if and only if (1) H divides F and G and (2) if K is 
any other polynomial that divides F and G, then K divides H. It can be 
shown that H is unique up to a constant multiple. 

Also, as in the case of integers, there is a division algorithm. 

Division Algorithm for Polynomials. If F{x) and <?(x) are polynomials over a field 
K ( for example, K might be the rationals, the reals, the complexes , the integers 
modulo p for p prime), there exist unique polynomials Q(x) and i?(x) over the field 
K such that 

F(x) = Q(x)G(x) + R(x), 

where R(x) = 0 or deg/t(x) < deg<?(x) (deg denotes degree ). 

Moreover, if K is an integral domain (such as the integers), the same result 
holds provided G(x) is a monic polynomial. 

As an example of the division algorithm for polynomials, let F(x) = 3x J + 
lx 1 - 5 and G(x ) = 2x 3 + 6x + 3. Then 

’ j v i_» 

j 

2x 3 + 6x + 1 )3x 5 +2x 2 ^5 

/ 3x 5 +9x 3 +lx 2 

- 9x 3 + j-x 2 - 5 

- 9 jc 3 -21x~ | 

\x 2 +21x-{ 



126 


4. Algebra 


In this case, Q(x) = \x 2 — f, and i?(x) = ±x 2 + 21 x — 4• (This example 
should make it clear that the algorithm will work in the general case only if 
the coefficients come from a field; however, it also should be clear that an 
integral domain is sufficient if the divisor is monic.) 

As in the case of the integers, the division algorithm can be used to find 
the greatest common divisor of two polynomials. Furthermore, as in the 
case of the integers, if F and G are polynomials (over a field K), there are 
polynomials S and T (over K ) such that 

gcd(F, G) = SF+ TG, 

where gcd(F, G) denotes the greatest common divisor of F and G. 

4.2.1. Find a polynomial P(x) such that P(x) is divisible by x 2 + 1 and 
F(x) + 1 is divisible by x 3 + x 2 + 1. 

Solution. By the conditions of the problem, there are polynomials S(x) and 
T(x) such that 

F(x) = (x 2 + l)S(x), 

P(x) + 1 = (x 3 + x 2 + l)T(x). 

It follows that (x 2 + l)£(x) = (x 3 + x 2 + lJTfx) — 1, or equivalently 

(x 3 + x 2 + l)r(x) - (x 2 + l)S(x) = 1. 

By our remarks preceding the example, x 3 + x 2 + 1 and x 2 + 1 are “rela¬ 
tively prime” and we can use the Euclidean algorithm for polynomials to 
find S'(x) and 7\x). Thus, we have 

+ x 2 + 1 = (x + 1 )(x 2 + l) + (- x), 

x 2 + 1 = — x( — x) + 1, 

and “working backwards,” we have 

1 = (X 2 + 1) + x (— x) 

= (x 2 + 1) + x[(x 3 + x 2 + 1) - (X + l)(x 2 + 1)] 

= (x 2 -I- 1)[ 1 - x(x + 1)] + x[x 3 + X 2 + 1 ] 

= (x 3 + X 2 + l)x - (X 2 + l)(x 2 + X - 1). 

In this form, we find that we can take £(x) = x 2 + x — 1 and T(x) = x. It 
follows that 

F(x) = (x 2 + l)(x 2 + x - 1). 


4.2.2. Prove that the fraction (n 3 + 2n)/(« 4 +3 n 2 + 1) is irreducible for 
every natural number n. 
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Solution. We have 


n 4 + 3n 2 + 1 = n(n 3 + 2n) + (m 2 4- 1), 

M 3 + 2m = M(M 2 + 1) + M, 

M 2 + 1 = n(M) + 1, 

«-"(!)■ 

It follows that gcd(M 4 + 3n 2 + 1,« 3 4- 2 m) = 1, and the proof is complete. 

Let F(x) be a polynomial over an integral domain D, and consider the 
polynomial equation F(x) = 0. If an element a of D is such that F(a) = 0, 
we say that a is a root of F(x) = 0, or that a is a zero of F(x). The following 
very useful theorem is an easy application of the division algorithm. 

Factor Theorem, If F(x) is a polynomial over an integral domain D, an element a 
of D is a root of F(x) = 0 if and only if x — a is a factor of F(x). 

By repeated application of the factor theorem, we can prove that there is a 
unique nonnegative integer m and a unique polynomial G(x) over D such 
that 

f(x) = (x — o) m G(x), 

where G(a) ¥= 0. In this case, we say that a is a zero of multiplicity m. 

The next two examples illustrate the use of the factor theorem. 

4.23. Given the polynomial F(x) = x n + a n _ 1 x"~ 1 + ■ • ■ + a Y x + a 0 
with integral coefficients a 0 ,a h .. ., a„_ 1 , and given also that there exist 
four distinct integers a,b,c,d such that F(a) = F(b) = F(c) = F{d) = 5, 
show that there is no integer k such that F(k) = 8. 

Solution. Let G(x) = F{x) — 5. By the factor theorem, x - a, x - b, x — c, 
and x — d are factors of G(x), and we may write 

G(x) = (x - «){x - h)(x - c)(x — d)H(x), 

where H(x) is a polynomial with integer coefficients. If A: is an integer such 
that F(k) = 8, then G{k) - F{k) - 5 = 8- 5 = 3, or equivalently, 

(A — «)(A — h)(A — c)(A — d)H(k) = 3. 

The left side represents a product of five integers, and each of the integers 
k — a, k — b,k — c,k — d must be distinct, since a, b, c, d are distinct. But 
this is impossible, since at most one of the numbers k — a,k — b,k ~ c, 
k — d can equal ±3, so the other three must be ± 1. Thus, such a A cannot 
be found. 



128 


4. Algebra 


4.2.4. Prove that if F(x) is a polynomial with integral coefficients, and 
there exists an integer k such that none of the integers F(l), F(2), . .., F(k) 
is divisible by k, then F(x) has no integral zero. 


Solution. It is equivalent to prove that if F(x) has an integral zero, say r, 
then for any positive integer k, at least one of F(l), F(2), . . . , F(k) is 
divisible by k. So suppose F{r) = 0. By the factor theorem, we can write 

F(x) ~(x- r)G(x), 

where G(x) is a polynomial with integer coefficients. From the division 
algorithm for integers, there are integers ^ t and s such that r = qk + s, 
0 < s < k (note the inequalities on s ). Substituting s = r — qk into the 
equation above, we get 

F(s) = {s - r)G(s) = - qkG(s). 

This equation shows that F(s) is divisible by k (G(s) is an integer), and this 
completes the proof. 

A simpler approach for this problem, based on modular arithmetic, is to 
observe that if a = b (mod k) then F(a) = F(b) (mod k). The result follows 
directly from the fact that for any given integer a, F{a) is congruent to one 
of F(l), .. . , F(k ) modulo k, and by assumption, none of these is divisible 
by k. 

The unique-factorization theorem for integers states that every integer 
can be written uniquely as a product of primes. There is a similar theorem 
for polynomials: every polynomial over a field can be written uniquely as a 
product of irreducible polynomials (i.e., prime factors). In the case of the 
complex numbers, the irreducible factors are the first-degree (linear) poly¬ 
nomials. In the case of the real numbers, the irreducible polynomials are 
the linear polynomials and the quadratic polynomials with negative dis¬ 
criminant (that is, those of the form ax 2 + bx + c, where b 2 — 4ac < 0). 

As in the case of integers, unique factorization is often a useful way of 
representing a polynomial. The next two examples illustrate the idea. 

4.2.5. Prove that every polynomial over the complex numbers has a 
nonzero polynomial multiple whose exponents are all divisible by 
1,000,000. 

Solution. Let the given polynomial be represented by the unique factoriza¬ 
tion 

P(x) = A {x - J 1 ) m '(x - j 2 ) mi ■ ■ (x- s k ) m \ 

where A is a constant, a.. s k are the roots of P(x) of multiplicities 

m 1 , . .., m k respectively. For any positive integer a (e.g., a = 1,000,000), 
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( x a — s°)/(x — s,) is a polynomial over the complex numbers (see Section 
4.1). Set 


_ a ( x a - sf {x a -s k \ m “ 

(JC) “ X ( *- J i ) ( ) 


Then Q(x) is a polynomial over the complex numbers, and 
/>(*) Q(x ) = A (x - J 1 ) m ‘ • (x- s k ) mt 




* 0 " 


(* - **) 




)) 


= ^Jc 0 (x a - • (JC° - j“) m ‘ 

is a polynomial all of whose exponents are divisible by a. 


4.2,6, Let / be a polynomial with real coefficients. Show that all the zeros 
of / are real if and only if f 2 cannot be written as the sum of squares 

f = g 2 + h 2 

where g and A are polynomials with real coefficients and deg g ^ deg A. 


Solution. Suppose f 2 = g 2 + A 3 , where g and A are polynomials with real 
coefficients, deg g deg A, and suppose that all the zeros of f are real. 
Write f in factored form: 

f(x) = A(x - a,)"' ■ ■ - (x - a k f l , 

where A is a nonzero real number. 

From the equation 

A\x^a l f m ' ■ ■ ■ {x-a k f mt = (g(*)) 2 + (A(*)) 3 
it follows that for each i = 1,2,. . ., k, 

0=U(fl,)) 2 +(*K)) 3 . 

Since g(a i ) and A(a ; ) are both real numbers, it must be the case that 
= 0 and A (a,) = 0. In fact, it follows that the multiplicity of these zeros 
is at least m r Thus, the factor theorem implies that there will be polynomi- 
als £,(.*) and A ( (x) with real coefficients such that g(x) = /(*)#,(*) and 
h{x)-= f{x)h x {x). It follows that 

1= Ui(*)) I+ (*i(*)) 2 - 

But this equation is impossible, because deg ¥= degA ( (that is, not both 
of gi and A[ are constants). This contradiction implies that / must have a 
zero that is not a real number. 



130 


4. Algebra 


Now suppose that not all the zeros of / are real numbers, and write / in 
factored form: 

f(x) = A(x- a,)"" ■■■(x- a,) m '(x 2 + 6,x + c ( )"' ■ • • (x 2 + b s x + c s )"\ 

where A is a real number, m,, . .. , m, are nonnegative integers, j is a 
positive integer and n s are positive integers, a^,b j7 Cj are real num¬ 

bers, and bj - 4c 7 < 0 fory = 1, . . ., j. We have 

x 1 + b jX + cj = (x 2 + bjx + \bf) + {cj -\bf) 

= (x + {bjf+(^Cj-bf} , 

which shows that each quadratic factor of / is a sum of squares. Replace 
each quadratic factor in the unique factorization of f 2 by its representation 
as a sum of squares. This yields an equation of the form 

/ 2 (x) = A 2 (x - a l ) lm ' ■ (x - a r ) 2m ' 

* (£i(*) + aft*))"' • • • (g?(*) + (*))"'- 

where g 1 ,. . ., g Si b l , .. ., h s are polynomials, deg g t = 1, and deg h i = 0. 

The result now follows by repeated use of the fact that the product of a 
sum of two squares with another sum of two squares is itself expressible as 
a sum of two squares: 

if + $ 2 W 2 + k 2 ) = (jh- gk) 2 + (,fk + ghf. 

Also, in this identity, if deg / > deg g and deg/i > deg A:, then degf/fc — gk) 
> deg (fk + gh). Thus, we see that there are polynomials g(x) and h(x) with 
real coefficients, deg g(x) deg/i(x) such that f 2 = g 2 + h 1 . 


Problems 

4.2.7. Find polynomials F(x) and G(x) such that 

(X 8 - l)F(x) + (x 5 - l)G(x) = x - 1. 

4.2.8. What is the greatest common divisor of x" — 1 and x m — 1? 

4.2.9. Let /(x) be a polynomial leaving the remainder A when divided by 
x — a and the remainder B when divided by x — b, a=£ b. Find the 
remainder when/(x) is divided by (x — a)(x — b). 

4.2.10. Show that x 4 " + x 46+1 + x 4r+2 + x 4d+2 t a,b,c,d positive integers, 
is divisible by x 3 + x 2 + x + 1. (Hint: x 3 + x 2 + x + 1 = (x 2 + l)(x + 1).) 

4.2.11. Show that the polynomials (cos# + xsin#)" — cosn# — xsin/i# is 
divisible by x 2 + 1. 
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4.2.12. For what n is the polynomial 1 + x 2 + x 4 + ■ ■ ■ + x 2n 2 divisible 
by the polynomial l+x + x 2 +---+x n_l ? 

4.2.13. A real number is called algebraic if it is a zero of a polynomial with 
integer coefficients. 

(a) Show that tJI + V3 is algebraic. 

(b) Show that cos(jt/ 2«) is algebraic for each positive integer n. (Hint: Use 
de Moivre’s theorem to express cos me as a polynomial in cos*.) 


4.2.14. If P(x ) is a monic polynomial with integral coefficients and k is 
any integer, must there exist an integer m for which there are at least' k 
distinct prime divisors of P(m)1 (Hint: First prove, by induction, that there 
are k distinct primes q t , .... q k and k integers n u . .. , n k such that q j 
divides P(n,) for i = 1, .. ., k. Then prove that a prime q divides P(n) if 
and only if q divides P(n + sq) for all integers s. An affirmative answer 
follows from these facts together with an application of the Chinese 
Remainder Theorem.) 

4.2.15. 

(a) Factor x 8 + x 4 + 1 into irreducible factors (i) over the rationals, (ii) 
over the reals, (iii) over the complex numbers. 

(b) Factor x" — 1 over the complex numbers. 

(c) Factor x 4 — 2x 3 + 6x 2 + 22x + 13 over the complex numbers, given 
that 2 + 3i is a zero. 

4.2.16. Here are two results that are useful in factoring polynomials with 
integer coefficients into irreducibles. 

Rational-Root Theorem. If P\x) = a n x n + a iI _ l z" _l + ■ ■ • + a,x + a 0 is a poly¬ 
nomial with integer coefficients , and if the rational number r/s (r and s relatively 
prime integers) is a root of P(x) = 0, then r divides a 0 and s divides a n . 

Gauss’ Lemma. Let P(x) be a polynomial with integer coefficients. If P(x) can be 
factored into a product of two polynomials with rational coefficients, then P{x) can 
be factored into a product of two polynomials with integer coefficients. 

(a) Let f(x) = a n x" + a„_,x n_1 + - ■ - + a x x + a 0 be a polynomial of de¬ 
gree n with integral coefficients. If a 0 , a„, and/(l) are odd, prove that 
f(x) = 0 has no rational roots. 

(b) For what integer a does x 2 — x + a divide x 13 + J + 90? 

4.2.17. 

(a) Suppose f(x) is a polynomial over the real numbers and g(x) is a 
divisor of f(x) and /'(*). Show that ( g(x)) 2 divides f(x). (This fact can 
be used to check/(x) for multiple roots.) 
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(b) Use the idea of part (a) to factor x 6 + x 4 + 3x 2 + 2x + 2 into a 
product of irreducibles over the complex numbers. 

4.2.18. Determine all pairs of positive integers (m,n) such that 1 + *" + 
x In + ■ ■ ■ + jc™ is divisible by 1 + x + x 1 + • ■ ■ + x m . 

4.2.19. 

(a) Let F(x) be a polynomial over the real numbers. Prove that a is a zero 
of mutliplicity m if and only if F(a) = F'(a) =•■•= F^ m) (a) = 0 and 
F im+I \a)¥= 0. 

(b) The equation f(x ) = x" — nx + n — 1 = 0, n > 1, is satisfied by x = 1. 
What is the multiplicity of this root? 

4.2.20. If n > 1, show that (x + 1)" — x” — 1 = 0 has a multiple root if and 
only if n — 1 is divisible by 6. 

4.2.21. Let P(jc) be a polynomial with real coefficients, and assume that 
P(x) > 0 for all x. Prove that P(x) can be expressed in the form 
( Q\(x)f + ( &(*)) 2 + •••+( G„0)) 2 where 0K*), Qi(x ),. . ., &(*) are 
polynomials with real coefficients. 

4.2.22. 

(a) Set « = cos(2tt/«) + f sin(2w/«). Show that 

+ . . . + X + 1 = (X - «)(* - W 2 ) • • ■ (* - '). 

(b) Set x = 1 and take the absolute value of each side to show that 

sin — sin — • • • sin -— ^ 1 — . 

n n n 2” “ 1 


Additional Examples 
1.12.2, 1.12.5, 6.5.13, 6.9.3. 


4.3. The Identity Theorem 

Let P be a nonzero polynomial of degree n over an integral domain D. 
According to the factor theorem, if a is a root of P(x) = 0, there is a 
polynomial Q of degree n — 1 such that P(x) — (x — a)Q(x). Using this 
fact, an easy induction argument shows that P has at most n zeros. 

The preceding observation has a very important corollary. Suppose that 
F and G are polynomials over a domain D, each of degree less than or 
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equal to n, and suppose that F and G are equal for n + 1 distinct values. 
Then F — G is a polynomial of degree less than n + 1 with n + 1 zeros. If 
F — G is not the zero polynomial, we have a contradiction to the reasoning 
in the previous paragraph. Therefore, F - G is the zero polynomial, and it 
follows that F equals G (coefficient for coefficient). (For another proof, see 
6.5.10.) 

Identity Theorem. Suppose that two polynomials in x over an integral domain are 
each of degre < n. If these polynomials have equal values for more than n distinct 
values of ic, then the two polynomials are identical. 

43.1. Determine all polynomials P(x) such that P(x 2 + 1) = (P(.x)) 2 + 1 
and P(0) = 0. 


Solution. We start by testing some cases: 

P{1) = P(0 2 + l) = (F(0)) 2 + l = 1, 

P( 2) = P(l 2 + 1) = (P(1)) 2 + I = 1 + 1 = 2, 

P(5) = P(2 2 + 1) = (P(2)) 2 + 1 = 4 + 1 = 5, 

P( 26) = P(5 2 + 1) = (F(5)) 2 + 1 = 5 2 + 1 = 26. 

In general, define jc 0 = 0, and for n > 0 define x n = x 2 _, + 1. Then an easy 
induction argument shows that P(x„ ) = x„. Thus, the polynomial P(x) and 
the polynomial x are equal for an infinite number of integers, and there* 
fore, by the identity theorem, P(jr) = x. That is, there is only one polyno¬ 
mial with the stated property, namely, F(jc) = jc. 

4.3.2. Prove that if m an n are positive integers and 1 < k < n, then 

i 0 (*-,)(:)-rn 

Solution. We proved this identity in Chapter 1 (see 1.3.4) by using a 
counting argument. Here is another proof, based on the identity theorem. 
The technique is standard: the polynomials (1 + Jt) m (l + x)" and (1 + 
x) m + n are equal for all values of x. Therefore, by the identity theorem, 
their coefficients are equal; that is, for each k, the coefficient of x k in 
(1 + ;t)'"(I''4- x)" is equal to the coefficient of jc* in (1 + x) m+n . It follows 
that 
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433 . For each positive integer n, show that the identity 

(x+y) n = ^ t x, y positive integers 

implies the identity 

(x+y)' 1 = 2 f ?)x*y n_ *, x, y real numbers. 


Solution. Let y 0 be an arbitrary but fixed positive integer, and let 

*(*)-(*+ *>)". Q( x ) = 2 (JW-*. 

k- O'-*' 

P{x) and Q{x) are polynomials in x, and they are equal whenever x is a 
positive integer. Therefore, by the identity theorem, F(x) and Q(x) are 
equal for all real numbers x. 

Now, let x Q be a fixed real number, and let 

SOO = (*o + )0" and T(y) = £ (£)*oV“* 

S(y) and T{y) are polynomials iny\ and since they are equal whenevery is 
a positive integer, it follows that S(y) = T(y) for all real numbers y. This 
completes the proof. 

(Incidentally, the identity 

» - , 

(x + y) n = 2 {i r ) xk y' , ~ l ‘, x, y positive integers 

can be proved neatly as follows. Let S = {1,2, .... n}; let A be a set with 
x elements and B be a set, disjoint from A, withy elements. Now, count, in 
two different ways, the number of functions from S to A U B. This, 
together with the preceding solution, constitutes another proof of the 
binomial theorem.) 

43 . 4 . Is x 5 — x 1 + 1 irreducible over the rationals? 


Solution. By the rational-root theorem (see 4.2.16), the only possible ra¬ 
tional zeros are ±1, and neither of these is a zero. Therefore, if the 
polynomial is reducible, it must necessarily be the product of a quadratic 
and a cubic. So suppose 

x 5 - x 2 + 1 = (x 2 + ax + h)(x 3 + cx 2 + dx + e). 

By Gauss’ lemma (see 4.2.16), we may assume that a,b,c,d,e are integers. 
Since these polynomials are equal for all x, their coefficients are equal; so. 
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equating coefficients, we get the following equations: 

a + c = 0, 
b + ac + d = 0, 
c + ad + e — — 1, 
bd+ ae = 0, 
be = 1. 

It is not difficult to show these equations cannot hold simultaneously. For 
example, the last equation shows that b and e are both odd. Thus, the 
fourth equation shows a and d have the same parity. Similarly, the first 
equation shows that a and c have the same parity. Therefore a, c, and d 
have the same parity. But then ac + d is even, and the second equation 
cannot hold (b is odd). Therefore x 5 — x 1 + 1 is not reducible over the 
integers, or the rationals. 

Another way to proceed with the problem is based on the following 
observation. If /, g, and h are polynomials over the integers and / = gh, 
then f = |/i(mod n), where J g, and h are the polynomials formed from /, 
g, and h respectively by taking their coefficients modulo n. If f is reducible 
over the integers, then /is reducible over the integers taken modulo n. In 
the case at hand, the polynomial x 5 — x 2 + 1 transfoms to x 5 + x 2 + 1 
(mod 2). The only irreducible quadratic polynomial over Z 2 = {0,1} is 
x 2 + x + 1 (the other quadratic polynomials and their factorizations mod¬ 
ulo 2 are x 2 = x • x, x 2 + 1 = (x + l) 2 , and x 2 + x = x(x + 1)). But x 2 + 
x + 1 does not divide x s + x 2 + 1 in Z 2 (x 5 + x 2 + 1 = (x 3 + x 2 X* 2 + x + 
1) + 1 (mod 2)), and therefore, x s + x 2 + 1 is irreducible over Z 2 . It follows 
that x 5 — x 2 + 1 is irreducible over the integers, and the rationals. 

In the preceding discussion, we made use of the fact that polynomials 
over Z„ can be added, subtracted, and multiplied in the usual manner 
except that the arithmetic (on the coefficients) is done within Z„ (i.e. 
modulo n). If n is a prime number, say n = p, then Z p is a field, so all the 
results concerning polynomials over fields (e.g., the factor theorem, the 
identity theorem) continue to hold. This is not the case if n is not a prime. 
For example, 2x 3 - 2.x, as a polynomial over Z 4 , has four distinct zeros in 
Z 4 , namely, 0, 1, 2, and 3, whereas it would have at most three if the 
arithmetic were carried out in a field. 

Let p be a prime, and consider the binomial theoiem modulo p 

(i + X ' )P = fc 2 o (£)**( m « l />), 

where each side is regarded as a polynomial over Z p . For 1 < k < p — 1, 
we have (£) = 0 (mod p), since none of the factors in k\(p — ky. divide the 
factor of p in pi. Thus, as polynomials over Z p , 

(1 + */= 1 + x p (mod p). 
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More generally, for each positive integer n, 

(l + */"= 1 + x p " (mod p). 

The argument is by induction. It is true for n = 1, and assuming it true for 
k, we have 

(1 + xf‘* =(1 +x/(l + xy k ■ ■ ■ (1 +xy* (mod p) 

p times 

= (1 + x*’*)^ + x*’*) • • • (1 + x pi ) (mod p ) 

= (1 + x pt ) (mod p) 

= 1 + (x pt ) P (mod p ) 

= (1 + x pk *') (mod;?). 

By equating coefficients of x' on each side, we find that 
j = 0 (mod p), 1 < j < p”. 

43.5. Prove that the number of odd binomial coefficients in any finite 
binomial expression is a power of 2. 

Solution. A conjecture, based on the examination of several special cases 
(see 1.1.9), is that the number of odd coefficients in (1 + x)" is 2\ where k 
is the number of nonzero digits when n is expressed in binary notation. 

An example will make it clear how the proof goes in the general case. 
Consider n = 13. In binary notation, 13 = 1101 2 = 8 + 4 + 1. Therefore, 

(1 + x) l3 =(l +x) 8 + 4 + 1 

- (1 + X) 8 ([ + x) 4 (l +*) 

= (1 + x 8 )( 1 + x 4 )( 1 + x) (mod 2), 

making use of the previously established result. From this we can see that 
there are eight odd binomial coefficients in (1 + x) !3 . This is because when 
the right side in the preceding equation is expanded, (1 + x 4 )(l + x) will 
have four terms, and (1 + x 8 )(l + x 4 + x + x s ) will have eight terms. (In 
general, if 1 + x" is multiplied by a polynomial P(x) of degree smaller than 
n, the result will be a polynomial with twice as many nonzero coefficients 
as the corresponding number in P(x).) 

Consider the polynomial equation x 2 + ax + b = 0, and suppose its 
roots are /■[ and r 2 . Then we can write 

x 2 + ax + b - (x - /-,)(x - rj) 

= x 2 -(r l + r 2 )x + r i r 2 . 
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From this, using the identity theorem, it follows that 

r ( + r 2 = — a, 

r i r 2 = b. 

Similarly, if .r 3 + ax 1 + bx + c = 0 has roots r^,r 2 , we have 
x 3 + ax 2 + bx + c — (x — /■[)(* — — rf) 

= Jc 3 - (/■[ + r 2 + r 3 )x 2 

+ ( r i r 2 + 'T'h + r 2 r i) x - rMi . 

In this case, 

+ r 2 + r 2 = - a, 
r i r i + ''i'j + r 2 r j = b, 
rfi'3 = ~ c 

In each case, we have expressed the coefficients of the polynomial 
equation in terms of the roots (in a rather patterned way). An induction 
argument shows this is true in general: specifically. 

If x" + a n _ t* n-1 + ■•* + a[jc + <j o = 0 has roots r [7 r 2 , . . ., r n then 

51 = r, + r 2 + ■ • • + r„ = -<!„_[ , 

5 2 = r l r 2 + ■ • • + nr„ +r 2 r 3 + ■ • ■ + nr„ + ■ ■ • + r„-ir„ = o„_ 2 , 

$3 = ry 2 r 2 + r 1 r 2 r 4 + - ■ ■ + r 2 r 3 r 4 + • - ■ + r h - 2 r„-ir„ = -fl„_3, 

S„ = r t r 2 ■ - ■ r n = ( - 1)%,, 

where 5, is the sum of all the products of the roots taken i at a time. 

43.6. Consider all lines which meet the graph 

y = 2x 4 +.7 x 2 + 3x-5 

in four distinct points, say (x,, yj, i = 1,2,3,4. Show that 

x, + x 2 + x 3 + x 4 
4 

is independent of the line, and find its value. * 

Solution. Let y = mx + b intersect the curve in four points (x j7 _y ( ), i — 1,2, 
3,4. Then x l ,x 2 ,.*: ; j,x 4 are the roots of the equation 

mx + b = 2x 4 + 7x 3 + 3x — 5, 

or equivalently, of 

*4 + j,3 + (3^»)* + (^!)_ 0 . 

It follows from our earlier remarks that (x, + x 2 + x 3 + x 4 )/4 = (— j)/4 
= — |, and this k independent of m and b. 
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43.7. Let P be a point on the graph of f(x) = ax 3 + bx, and let the 
tangent at P intersect the curve y = f(x) again at Q. Let the x-coord in ate of 
P be x 0 . Show that the x-coordinate of Q is — 2x 0 . 

Solution. The straightforward approach is to write the equation of the 
tangent to the curve y = /(x) at P, sayy = T(x), and to so Ivey = 7"(x) and 
y = f(x) simultaneously to find Q. 

Another approach is to argue as follows. We recognize that solving 
y = T(x) andy = f(x) simultaneously is the same as finding the roots of 
f(x) - T(x) = 0. Now x 0 is a double root (that is, of multiplicity 2) of this 
equation, since T(x) is tangent toy = f(x) at x 0 . What we seek is the third 
root, denoted by x : . We know that the sum of the roots, 2x 0 -I- x 1( is equal 
to the coefficient of the ,x 2 term. But the coefficient of the x 2 term is 0, so it 
follows that x j = —2x 0 . 

43.8. Let x : and x 2 be the roots of the equation 

x 2 — (a 4- d)x + (ad — be) = 0, 

Show that x^ and x\ are the roots of 

y 2 — (a 3 + d 2 + 3 abc + 3 bcd)y + (ad — bc) 2 = 0. 

Solution. We know that 

x ( + x 2 = a + d, 
x,x 2 = ad ~ be. 

Since (x, + x 2 ) 3 = x 3 + 3x 2 x 2 + 3x x xj + x\, we have 
x 3 + x\ = (x! + x 2 ) 3 — 3x 2 x 2 — 3X!X 2 
= (a + d) 2 — 3x|X 2 (x, + x 2 ) 

= (a + d ) 3 — 3(ad — bc)(a + d ) 

= (a + d)[a 2 + lad + d 2 — had + 3 £c] 

= (a + d)(a 2 - ad + d 2 + 3 be) 

= a 3 + d 3 + 3 abc + 3 bed. 

Furthermore, 

XjX 2 = (ad - be) 3 , 

and the proof is complete. 

43.9. Let a,b,c be real numbers such that a + b + c = 0. Prove that 
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Solution. Here is a very clever solution based on the ideas of this section. 
Let A = ab + ac + be and B = abc. Then a , b, c are roots of the equation 

x 3 + Ax - B = 0. 

For each positive integer n, let T n = a n + b n + c". Then, 

To-3, 

T,= 0, 

T 2 = (a + b + c'f - 2(ab + ac + Ac) = - 2A. 

For n > 0, T n+2 = —AT n+ , + BT„ (substitute a,b,c into x" + 3 = —Ax n+l 
+ Bx” and add), and this gives 

T 3 -/IS, + BS 0 = 3b; 

T 4 = ~AS 2 + BS l = 2A\ 

T 5 = -AS 2 + BS 2 = -5AB. ' 

It follows that 



43.10. Show that the polynomial equation with real coefficients 
P(x) = a n x n + a„_ l x fl “ l + • ■ • + a 2 x 3 + x 2 + x + 1=0 
cannot have all real roots. 


Solution. Let r t ,r 2 ,...,r n denote the roots of F(jc) = 0. None of 
r lt . . . , r n is zero. Divide each side of P(x) = 0 by x n and setj = 1 /jt, to 
get . 

Q(y) = y" +y”~ 1 + y n ~ 2 + + • • • + a y y + a a = 0 . 

Note that r is a root of P(x) = 0 if and only if \/r is a root of Q( y) = 0. 
Therefore, the roots of Q(y) = 0 are s,,.s 2 , . . . , where s : = I//,, i = 

1.«. It follows that 

1-1 


and therefore. 


2 Vy = U 

><j 


S^=(i^) 2 -22Vy=l-2=-l. 

1=1 \:— 1 / i<j 

This equation implies that not all the j/s are real; equivalently, not all the 
r ,’s are real. 
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Problems 

43.11. Let k be a positive integer. Find all polynomials 
P(x) = a n x n + • ■ ■ + a t x + a 0 , 
where the are real, which satisfy the equation 

P(P(*))=[P(*)]*. 


43.12. 

(a) Prove that log* cannot be expressed in the form f(x)/g(x) where f(x) 
and g(x) are polynomials with real coefficients. 

(b) Prove that e* cannot be expressed in the form f(x)/g(x) where /(*) 
and g(x) are polynomials with real coefficients. 

43.13. Show that 

(l+*)'-.x(l+*)"+* ! (l+ *)"-+ ± **(>+*)” 

and use this identity to prove that 


43.14. 

(a) Differentiate each side of the identity 

(>+*)■- i (£)»*• 

*=o VK/ 

By comparing the coefficients of * t_1 in the resulting identity, show 
that 

"(*-!)-*(*) 

(b) Use the result of part a to show that 


43.15. Let .U” ) = *(* - 1) • • * (x — n + 1) for n a positive integer, and let 
x (0 > = 1. Prove that for all real numbers * and y 
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(Hint: This can be done by induction, but consider a proof similar to 4.3.3 
which first establishes the result for positive integers x and v\ For this, 
count in two different ways, the number of one-to-one functions from 
{1,2,. . . , n } into A U B, where A is a set with x elements and B is a set, 
disjoint from A , with y elements. Prove the identity for all real numbers by 
making use of the identity theorem.) 

43.16. Is x 4 + 3x 3 + 3x 2 — 5 reducible over the integers? 

43.17. Let p be a prime number. Show that 

(a) (V) = (“0* (mod p\ 0 < k < p- 1, 

(b) (/’t l ) = 0(mod p), 2 < k < p- 1, 

(c) ($) £ Q ( mod p),a>b>0, 

(d) (p = 2(mod^). 

43.18. Let w = cos(2tt/m) + i sin(27r/«). 

(a) Show that 1,«, w 2 ,. . . , w" _l are the ft roots of x" — 1 = 0. 

(b) Show that (1 — w)(l — w 2 ) • • • (1 — w n_1 ) = n. 

(c) Show that «+*•*+ «" _l = — 1. 


43.19. 

(a) Solve the equation x 3 - 3x 2 + 4 = 0, given that two of its roots are 
equal. 

(b) Solve the equation x 3 — 9x 2 + 23x 15 = 0, given that its roots are in 

arithmetical progression. 

43.20. Given r,s,t are the roots of x 3 + ax 2 + bx + c = 0. 

(a) Evaluate 1/r 2 + 1/s 2 + l/( 2 , provided that 0. 

(b) Find a polynomial equation whose roots are r 2 ,s 2 , t 2 . 

43.21. Given real numbers x, y,z such that 

x +y + z = 3, 
x 1 + y 2 + z 2 = 5, 
x 3 +y 3 + z 3 = 7, 

find x 4 +y 4 + z 4 . (Hint: Use an argument similar to that used in 4.3.9.) 

We close this section with three problems which draw attention to some 
additional results about polynomials that are very useful in certain prob¬ 
lems. 


43.22 (Theorem). If X[,x 2 ,. . *, x B are distinct numbers, and j,, - ,y„ 

are any numbers, not all zero, there is a unique polynomial /(x) of degree 
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not exceeding n — 1 with the property that /(x,) = y lt /(x 2 ) = y 2 , . . ., 
f{x„)=y n . 


Outline of Proof: 

(a) Let g(x) = (x — Xj)(x - x 2 ) ■ • • (x — x„). Show that 

g(x) / (*~ x 2 )(x ~ *3) ■ • • (x - x„) \ 

(x - x,)g'(-*i) { (x, - x 2 ) • * • (x, - x„) j 


is a polynomial of degree n — 1 with zeros at x 2 , . . ., x n and which 
equals 1 at x = x r 

(b) Lagrange interpolation formula. Show that 


/(*) 


£(-*) *(*) _ 
(X - X^g'fX!) (X - X 2 )g\x 2 ) yi 


+ (X - X„)£'(X„) y " 

takes the values y y 2 , . . . ,y„ at the points x,, . , . , x n respectively. 

(c) Application. Suppose that P(x) is a polynomial which when divided by 
x— 1, x — 2, x — 3 gives remainders of 3,5,2 respectively. Determine 
the remainder when P(x) is divided by (x — l)(x — 2)(x — 3). (Hint: 
Write P{x) — £?(x)(x — lKx — 2)(x — 3) + R(x), where ^i(x) is of de¬ 
gree less than 3. Find ft(x) by the Lagrange interpolation formula, 
since rt(l) = 3, R{2) = 5, rt(3) = 2.) 

43.23 (Partial Fractions). 


(a) Show that if /(x) is a polynomial whose degree is less than n, then the 
fraction 

_/tf)_ 

(x - x,)(x - X 2 ) • • . (x - x„) ’ 


where x ( ,x 2j . . . , x„ are n distinct numbers, can be represented as a 
sum of n partial fractions 


Ay 

x — x ( 



+ • • 


• + 


A 


n 


x — x„ 


where A lr ...,A„ are constants (independent of x). (Hint: Use 
Lagrange’s interpolation formula: divide each side by g(x), etc.) 

(b) Application. Let /(x) be a monic polynomial of degree n with distinct 
zeros x ( ,x 3 , .. ., x„. Let g(x) be any monic polynomial of degree 
n — 1. Show that 


Y gj*j) 

j -1 A*/) 


= 1. 


(Hint: Write g(x)/f(x) as a sum of partial fractions.) 
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43.24. A sequence of numbers u 0 ,u l ,u 2i . . - is called a sequence of fcth 
order if there is a polynomial of degree k, 

P(x) = a k x k + a k _ t x k ~ l +•••+«!* + a 0 

such that t/; = P(i) for i = 0,1,2, . . . . 

The first-difference sequence of the sequence « 0 , u,, u 2 ... is the se¬ 
quence u^ l \u\ li ,u^ ] \ . .. defined by 

= «„+, ~ u„, n = 0,1,2,3,... . 

(a) Prove that if u 0 ,u l ,u 2 , . . . is a sequence of order k, then the first- 
difference sequence is a sequence of order k — 1. Define the second- 
difference sequence of u 0 ,u lt u 2 ,. . . to be the first-difference sequence 
of the first-difference sequence, that is, the sequence w& 2 \wj 2> ,t4 2) , . . . 
defined by 


~ u n +2 2 h „ + , + u n , n 0 , 1 , 2 , . ., . 

From part (a) it follows that u^ 1} ,u\ 2) ,u ( 2 2 \ ... is a sequence of order 
k — 2. Similarly, define the third-difference sequence, the fourth- 
difference sequence, and so forth. Repeated application of part (a) 
shows that if w 0 ,w t ,u 2 , ... is a sequence of order k, the {k + l)st 
difference sequence will be identically zero. We aim to establish the 
converse: if the successive difference sequences of an arbitrary se¬ 
quence n 0 ,n[,W 2 , . . . eventually become identically equal to zero, then 
the terms of original sequence are successive values of a polynomial 
expression; that is, there is a polynomial P(jc) such that «„ = P(n), 
«- 0 , 1 , 2 ,... . 

(b) Use induction to prove that * 

».-(SK + (>r + ("K> + --- + CK>. 

(c) Suppose that the original sequence is described by the function F(x). 
That is, suppose that F(n) =«„,« = 0,1,2,.... For k = 0,1,2,3,. . ., 
let A*F(0) = uf* , and for x a real number and i a positive integer, let 
x {,) = x(x - l)(jc — 2) • ■ ■ (x - i + 1). Show that the result of part (b) 
can be written in the form 


« A*F(0) 

m- 2 -rr 1 ""- 

k-0 K - 

Note the similarity to the Taylor expansion of F(x): 


F{x) = 2 (/^O)/*!)**. 


(d) Prove that if the (k + l)st difference sequence is identically zero, then 
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the original sequence is given by 


**(«) 


k 


2 

/ = 0 


am.* 


(e) Use the result of part (d) to find a closed formula for the sum of the 
series 1* + 2 4 + ■ • • + n 4 . (Hint: Notice that the first-difference se¬ 
quence is given by a polynomial of degree 4, and therefore, the sum will 
be a polynomial of degree 5.) 


Additional Examples 

4.4.30, 4.4.31, 7.2.10, 8.2.2, 8.2.3, 8.2.10, 8.4.11. 

4.4. Abstract Algebra 


A group is a set G together with a binary operation * on G such that: 

(i) Associative property. For all elements a, b,c in G 

( a*b)*c = a*(b*c). 

(ii) Identity. There is a unique element e in G (called the identity of G) 
such that for every element a in G, 

a*e - a = e*a. 

(iii) Inverse. For each element a in G, there is a unique element a ~ 1 in G 
(called the inverse of a) such that 

a~ } *a = e = a*a~ } . 

When working with groups, we sometimes think of the operation * as 
“multiplication,” and in this case we often suppress the * in writing 
products. Thus, a*b is written simply as ab, and a*(b*c) is written as a(bc), 
or abc, and so forth. Furthermore, when we think of * as a product, we 
sometimes denote the identity element as “1.” In addition, we use exponen¬ 
tial notation to simplify expressions; e.g., a 4 =■ aaaa, etc. It is not difficult 
to show that the usual laws of exponents hold in a group, namely, 

a n a rH -a n+m J (a n ) m = a™, n,m integers. 

The group operation need not be commutative; i.e., it may not be the 
case that ab = ba for all elements a, b of G. An example of such a group is 
the set of n-by-n nonsingular matrices over the real numbers. 

In any group G, it is the case that 

(ab )~ 1 = 


a,4e G. 
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This identity is fundamental and can be proved in the following way. 
Observe that {ab){b~'a ~ l ) = a(b{b~ l a -1 )) = a((bb " l )a “ l ) = a(ea~') 
= aa~ l = e, and (b~ l a~ l ){ab) = b~\a~\ab)) = b~\{a~ l a)b) = b ~\eb) 
= b~'b = e. Therefore b~ l a _i is an inverse for ab. But ab has a unique 
inverse, denoted by ( ab)~ l . It follows that (ab)~ l = b~ l a~\ 

If the group G is commutative (i.e., if ab = ba for all a,b e G), it is easy 
to show that 

(af>)" = a"b n , a,b EC n an integer. 


4.4.1. Suppose that G is a set and * is a binary operation on G such that: 

(i) Associative property. For all a,b,c in G, a*(b*c) = {a*b)*c\ 

(ii) Right identity. There is an element e in G such that for every element a 
in G, a*e = a; and 

(iii) Right inverse. For each element a in G, there is an element a~ l \n G 
such that a*a ~ 1 = e. 

Prove that G is a group. 


Solution. We will show that the right identity e is also a left identity, and 
the right inverse a -1 is also a left inverse for a. Then we will show that e 
and a ~ 1 are unique. 

Observe that a ~ 1 is an element of G, and therefore by (ii), there is an 
element (a -1 ) -1 in G such that (a~ ')*(«“ ! ) _ 1 = e. We now compute 

a~ 1 a = (a- l a)e = (a- , a)(a- ‘(a -1 ) - ') 

= n- 1 [(o«- l )( a - | )“ 1 ] 

= '] = (a- l e)(a- 1 )' 1 

= e. 

This shows that a -1 is an inverse (left inverse and right inverse). 

Also, ea = (aa~ l )a = a(a~ l a) = ae = a, and therefore e is an identity 
for G (that is, for each a, ea = a ~ ae). 

Suppose e' is also an inverse for G. Then e = e*e' (because e' is an 
identity) = e’ (because e is an identity). This shows the identity element of 
G is unique. 

Suppose (a~ l )' is also an inverse for a. Then = (a -1 )' 

( aa~ l ) = [(a l )'a]a~ ] = 1 = a~ l . This shows the inverse of a is unique. 

It follows that G is a group. 
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4.4.2. Let G be a group. 

(a) Cancellation property. For all a,b,c in G, show that 

ab = ac implies b = c, 
ba = ca implies b = c. 

(b) Let a be an element in G , and consider the sequence 

1 ,a,a 2 ,a 2 , .... 

Show that either all the elements in the sequence are different, or there 
is a smallest integer n such that a" = 1 and 1 ,a, . . . , a"~ l are distinct. 
In the latter situation, n is called the order of a , denoted by ord(a); in 
the former case we say that a has infinite order. 


Solution, (a) This follows immediately by multiplying each side on the left 
(and right respectively) by a~ l . 

(b) Suppose that not all elements in the sequence are different, and let n 
be the smallest integer such that a ” is a repetition of a previous element in 
the sequence. Then a" = 1, for if a" = a‘, 0 < i < n, then by the cancella¬ 
tion property, o' 1-1 = o'" 1 , and this contradicts our choice of n. 

4.43. Let a and b be two elements in a group such that aba = ba 2 b, 
a 3 = e, and b 2n ~ 1 = e for some positive integer n. Prove that b = e. 

Solution. Note that if ab = ba, then aba = ba 2 b is the same as a l b = a 2 b 2 , 
and the cancellation property implies b = e. Although the group may not 
be commutative, we shall prove that this particular set of equations for a 
and b does imply that ab = ba. 

Notice that ab = ba is the same as ab 2n = b 2n a, since by assumption 
b 2 " = b. To show that ab 2n = b 2n a it suffices to show that ab 1 = b 2 a, since 
ab 2 " = a(b 2 )" = ( b 2 )"a (by repeated application of ab 2 = b 2 a) = b 2n a. 

Thus, the proof is complete after observing that ab 2 = {aba){a ~ 'b) 
= (ba 2 b)(a~ { b) = (ba 2 )(ba~ l b) ^ (ba 2 )(ba 2 b) = (ba 2 )(aba) = ba 3 ba = b 2 a 
(since a 3 = e). 

Let G be a group. We say that H is a subgroup of G if H is a subset of G 
which is itself a group (under the operation of G). The order of H is defined 
to be the number elements in H, and this number is denoted by ord (H). 

An important class of subgroups are the following. Let oeC, and let 

{a} = {a" : n is an integer}. 

It is easy to check that (a) is a subgroup of G; it is called the cyclic 
subgroup generated by a. Note that ord(a) = ord«< 2 ». 

The following theorem constitutes one of the most important results in 
the theory of finite groups. 
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Lagrange’s Theorem. If H is a subgroup of a finite group G, then the order of H 
divides the order of G. 

Here are three important corollaries. 

(i) If G is a group of order n and tiEG, then a" = 1. 

(ii) If G is a group of order p, where p is a prime, then G is a cyclic group 
(i.e., G = <a> for some a e G). 

(iii) If G is a group and a" = 1, then the order of a divides n. 

We will leave the proof of Lagrange’s theorem as a problem (see 4.4.18); 
however, it is instructive to see the arguments for the corollaries. 

Proof of (i). Let a £ G and let m = ord(a). By Lagrange’s theorem m 
divides n, 50 suppose n = mq for some integer q. Then a" = a mq = ( a m ) q 
= L = 1. 

Proof of (ii). Let a be an element of G different from the identity. Then 
<a> is a subgroup of G with more than one element (namely, 1 and a). By 
Lagrange’s theorem, the order of <a> divides p, but since p is prime, it must 
be the case that (a) is of order p; that is, (a) = G. 

Proof of (iii). Let m = ord(a). By the division algorithm there are inte¬ 
gers q and r such that n = qm 4 - r, 0 < r < m. Thus 1 = a" = a tpn+r 
= ( a m ) <! a r = a r . Since l,o, . . ., a m ~ 1 are distinct, it must be the case that 
r = 0 , and it follows that m divides « (this is a typical application of the 
division algorithm for integers). 


4.4.4. If in the group G we have a 5 = 1, aba 1 = b 2 for some a,b e G, 
find ord(fc). 


Solution. Since a 3 = 1 , the order of a is either 1 or 5. If ord(a) = 1, then 
a = 1 and it follows that b = b 2 , or b = 1 , apd so ord(fc) = 1 . 

Suppose ord(a) = 5. We have (aba~ l )(aba -l ) = (b 2 ) 2 , or equivalently, 
ab 2 a~ l = b 4 . Substituting aba~ 1 for b 2 on the left side of this equation 
yields a 2 ba~ 2 = b 4 . Squaring this, we get (a 2 ba~ 2 )(a 2 ba~ 2 ) = (b 4 ) 2 , or 
equivalently, a 2 b 2 a~ 1 = b B . Again, substituting aba~ 1 for b 2 on the left, we 
get a 2 ba~ 2 = b s . Squaring gives a 2 b 2 a~ 3 = b 16 , and substituting gives 
a 4 ba~ 4 = A 16 . One more time: a 4 b 2 a~ 4 = b 32 , or equivalently, a 5 ba~ 5 = b 32 . 
But a 5 = a~- = 1, so b = b 32 , and on cancellation, we get b 3 ' = 1. Since 31 
is a prime number, the order of b is 1 (if b is the identity) or 31. 


4.4.5. If G is a finite group and m is a positive integer relatively prime to 
the order of G, then for each a in G there is a unique b in G such that 
b m = a. 


Solution. Let T:G-*G be defined by T(x) = x m . We aim to show that T is 
a one-to-one function. So suppose that T(x) » T{y) for elements * and y 
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of G. Then x m = y m . Let n = ord(G). Since n and m are relatively prime, 
there are integers s and t such that sn + tm = 1. Hence x = x sn+rm 
= (x n y(x m y = (x m )' (since x" = 1) = (y m y (since x m = y m ) ~ (_y") s 
(. y m y (since y” = 1) = y s " +rm =y. 

Therefore T is a one-to-one function, and since G is a finite set, T is onto 
G. That is, for a E G, there is a unique b in G such that T(b) = a 
(equivalently, b m = a). 

The first corollary to Lagrange’s theorem states that tf ordlG) = 1 for each 
element in the finite group G. This has a number of interesting and 
important consequences when applied to particular groups. For example, 
let V„ denote the set of positive integers less than n that are relatively prime 
to n. The elements of V„ form a group under multiplication modulo n. Let 
qp(n) = ord(K„). (The function tp is called the Euler ^-function.) Then 
Lagrange’s theorem implies the following. 

Euler’s Theorem. If a is any integer relatively prime to n, then 

a*("> = 1 (mod n). 


When n is a prime number, say n = p, we have <p(/>) = p — 1, so that 
a p ~ 1 = 1 (mod p) whenever a is not a multiple of p. If we multiply each 
side by a, we get a p = a (mod />). This congruence holds even when a is a 
multiple of p, and thus we have the following result. 

Fermat’s Little Theorem. If a is an integer and p is a prime, then 

a p = a (mod p). 


4,4,6, Prove that each prime divisor of 2 P — 1, where p is a prime, is 
greater than p. (It is a corollary that the number of primes is infinite.) 


Solution. The result is true for p = 2, so henceforth assume that p is odd. 
Suppose that q is a prime that divides 2 P — 1. Then q is odd and 2 P = 1 
(mod q). By Fermat’s little theorem, 2 ?_1 = 1 (mod q). If q = p we have 
2 = 2 xl= 2 x 2 ' ?- 1 = 2 ? = 2 p =l (mod q), a contradiction. q < p, then 
q — 1 and p are relatively prime, so there are integers $ and t such that 
sp + t{q — 1)= 1. It follows that2 = 2 sp+, ^~^ =(2 p Y(2i- l y = 1 (mod q), 
a contradiction. Thus q must be larger than p. 


4.4.7. Show that if n is an integer greater than 1, then n does not divide 
2 * - 1 . 


Solution. Suppose that n divides 2" — 1 ; that is, 2" = 1 (mod n). Clearly, n 
is an odd number, since 2" — 1 is odd. Suppose that p is a prime divisor 
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of n. Then 2 n = 1 (mod p). Now, regard 2 as an element of the group V p . 
We know that 2 p ~ l = 1 (mod p) (Fermat’s little theorem, since gcd(2,/t) 
= 1). By the third corollary to Lagrange’s theorem, p — I divides n. So far 
there is no contradiction. However, suppose p is chosen as the smallest 
prime which divides n. Then these same conclusions hold, but now, the fact 
that ord( 2 ) divides n and ord( 2 ) divides p — 1 produces a contradiction to 
our choice of p. Therefore, « can never divide 2" — 1. 


4,4.8. Show that for any positive integer n there exists a power of 2 with a 
string of more than n successive zeros (in its decimal representation). 


Solution. For any positive integer s, there exists a positive integer t such 
that 2' = 1 (mod 5 1 ) (for example, take t = <p(5 J )). Let s — 2n. There exist 
positive integers q and r such that 2 r — 1 = q X 5 2 ". Multiply each side by 
2 2 ", rewrite as 

2 r+ln = 2 ln + qX 10 2 ”, 

and notice that 2 r+in has at least n consecutive zeros in its decimal 
representation, since 2 2n < 10 ”. 


4.4.9. Given positive integers a and b, show that there exists a positive 
integer c such that infinitely many numbers of the form an + b (n a 
positive integer) have all their prime factors < c. 


Solution. The result is obviously true when a = 1, so suppose a > 1. First, 
consider the case in which gcd(a, b) = 1. We will prove there are an infinite 
number of terms of the arithmetic sequence an + b among the terms of the 
sequence (a + b) k , k = 1,2 ,3,. . . . ■ 

From Euler’s theorem, b* <a) = 1 (mod a), since b is relatively prime to a. 
It follows that for each positive integer s, 

(a + b) MaHl =b^ (a)+l = (b^)b = b (mod a). 

This means that for each positive integer s there is an integer q, such that 

(a + by via)+l =q s a + b. 

It follows that each of the terms q s a + b, x = 1,2,3, .. . has only those 
prime factors that occur in a + b. 

Now consider the case in which gcd(n,h) = d > 1. Then gcd {a/d,b/d) 
= 1 , so from our preceding argument, there is a c such that infinitely many 
members of the sequence ( a/d)n + ( b/d ) have all their prime factors < c. 
From this it follows that infinitely many members of the form an + b have 
all their prime factors < cd. This completes the proof. 
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A ring is a set R with two binary operations, + and • , such that 

(i) R is a commutative group with respect to the operation + ; 

(ii) For all a, b, c in R, a{bc) = (ab)c (“ ■ ” suppressed); 

(iii) For all a, b,c in R, 

a(b + c) = ab + ac, 

(b + c)a = ba + ca. 

R need not have a multiplicative identity: if it does, we say R is a ring 
with identity. The multiplication in R need not be commutative: if it is, we 
say that ii is a commutative ring. 


4.4.10. Let a and b be elements of a finite ring such that ab 2 = b. Prove 
that bab = b. 


Solution. Obviously, if the ring were commutative the result would be 
immediate, but we must show the result holds even when the ring is 
noncommutative. In addition, we cannot assume the ring has a multiplica¬ 
tive identity. 

Suppose b = b 2 . Then bab = bab 2 = b 2 = b, and we are done. Sup¬ 
pose b = b m for some integer m > 2. Then bab = bab m = b(ab 2 )b m ~ 2 
= b 2 b m ~ 2 — b m = b, and we are done. Therefore it is sufficient to show 
that b — b m for some integer m > 2 . 

Suppose the ring has n elements. By the pigeonhole principle, at least 
two elements in the sequence b,b 2 , . . ., b H ,b n + ] are equal. Let i be the 
smallest integer such that b' equals some subsequent power of b in the 
preceding sequence; that is, b ' = b 1 + b, 1 < i < i + j < n + 1. Suppose 
i > 1. Then multiply each side of ab 2 = b on the right by b' +J ~ 2 to get 
ab l+j = b' + -i~\ But since b‘ = b‘ +J , we have ab' = From here there 

are two cases to consider. 

Suppose i = 2. Then b = ab 2 = 1 (from the last equation), and this 

contradicts our choice of i. So, suppose / > 2. Then b‘~ l = b-b'~ 2 
= (ab 2 ) X b‘~ 2 = ab‘ = ab ,+ b~\ which again contradicts our choice of i. 
Therefore, i = 1; that is, b = b J for some j. By the argument in the first 
paragraph, the proof is complete. 

An integral domain D is a commutative ring with unity in which for a, b 
in D, ab = 0 implies a = 0 or b = 0. The cancellation property holds in an 
integral domain. For, suppose ab = ac and a #0. Then a(b — c) = 0, so 
b — c = 0, or equivalently, b = c. Similarly, ba = ca, a ¥= 0, implies b = c. 

A field is a commutative ring with identity in which every nonzero 
element has a multiplicative inverse. 
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4.4.11. Show that a finite integral domain (an integral domain with only a 
finite number of elements) is a field. 

Solution. We must show that every nonzero element of the integral domain 
has a multiplicative inverse. So, let D* = {«,, ...,«„} be the nonzero 
elements of the integral domain, and consider an arbitrary element a of D*. 
Define T : D* -» D* by T(a,) = aa r If T(a t ) * T(aj) then aa i = aa Jt so by 
the cancellation property, a i = a } . Thus we see that T is a one-to-one 
function. Since D * is finite, the mapping T is onto D*. But one of the 
elements in D* is the multiplicative identity, denoted by 1. Therefore, 
T{a k ) = 1 for some a k £ D*\ that is, aa k = 1. This shows that a has a 
multiplicative inverse. 

Problems 

4.4.12. Let G be a set, and * a binary operation on G which is associative 
and is such that for all a,b in G, a 2 b = b = ba 2 (suppressing the *). Show 
that G is a commutative group. 

4.4.13. A is a subset of a finite group G, and A contains more than 
one-half of the elements of G. Prove that each element of G is the product 
of two elements of A . 

4.4.14. Let H be a subgroup with ft elements of a group G. Suppose that G 
has an element a such that for all a in H, (jcu ) 3 = 1, the identity. In G, let P 
be the set of all products x t ax 2 a ■ ■ ■ x n a, with n a positive integer and the 
A', in H. Show that P has no more than 3 h 2 elements. 

4.4.15. If a~ } ba=b~ l and b~ } ab = a~ l for elements a,b of a group, 
prove that a 4 = b* = 1. 

4.4.16. Let a and b be elements of a finite group G. 

(a) Prove that ord(u) = ord(u _i ). 

(b) Prove that orA(ab) = ord( 6 zr). 

(c) If ba = a 4 b 3 , prove that ord(a 4 b) = ord(aV). 

4.4.17. Let a and b be elements of a group. If b~ l ab = a k , prove that 
b~ T a s b r = a sk ' for all positive integers r and s. 

4.4.18 (Outline for the proof of Lagrange’s Theorem). Let G be a finite 
group and H a subgroup with m distinct elements, say H — {\,h 2 , 
A 3 , . . ., h m ). For each a e G, let Ha = {a,h 2 a,h 3 a, . .., h m a}. 

(a) Prove that Ha contains m distinct elements. 

(b) Prove that Hh i = H. 

(c) If b & Ha, prove that Ha and Hb are disjoint sets. 

A 



152 


4. Algebra 


(d) Prove that there are elements a,,a 2 , . .. , a k in G such that G = Ha x U 
Ha 2 U - - ■ U Ha k and Ha t n Haj = 0 if i ^ j. 

(e) Use the previous results to formulate a proof of Lagrange’s theorem. 

4.4.19. Find the smallest integer n such that 2" - 1 is divisible by 47. 

4.4.20. Prove that if p is a prime, p > 3, then ab p - ba p is divisible by 6p. 

4.4.21. Let a and b be relatively prime integers. Show that there exist 
integers m and « such that a m + b" = 1 (mod ab). 

4.4.22. If a,b,c,d are positive integers, show that 30 divides a 4b+<i ~ 

a 4c + d 

4.4.23. Let T„ — 2" + 1 for all positive integers. Let <p be the Euler <p- 
function, and let k be any positive integer and m = n + k<p(T n ). Show that 
T m is divisible by T n . 

4.4.24. Prove that there exists a positive integer k such that k2 n + 1 is 
composite for every positive integer n. (Hint: Consider the congruence class 
of n modulo 24 and apply the Chinese Remainder Theorem.) 

4.4.25. A Boolean ring is a ring for which a 2 = a for every element a of the 
ring. An element a of a ring is nilpotent if a" = 0 for some positive integer n. 
Prove that a ring R is a Boolean ring if and only if R is commutative, R 
contains no nonzero nilpotent elements, and ab(a + b) = 0 for all a, b in R. 
(Hint: Show that a 4 — a 5 = 0, and consider (x 1 — ,x 3 ) 2 .) 

4.4.26. Let R be a ring with identity, and let aefi. Suppose there is a 
unique element a' such that aa = 1. Prove that a’a = 1. 

4.4.27. Let R be a ring with identity, and a be a nilpotent element of R 
(see 4.4.25). Prove that 1 — a is invertible (that is, prove there exists an 
element b in R such that 6(1 — a) = 1 = (1 — a)b). 

4.4.28. Let R be a ring, and let C = {x e R: xy = yx for all v in R ). 
Prove that if x 2 — x E C for all x in R, then R is commutative. (Hint: Show 
that xy + _yx e C by considering x + y, and then show that x 2 E C.) 

4.4.29. Let p be a prime number. Let J be the set of all 2-by-2 matrices 
(“ b d ) whose entries are chosen from {0,1,2, .. . ,p — 1} and satisfy the 
conditions a + d = 1 (mod p), ad — be = 0 (mod p). Determine how many 
members J has. 

4.430. Let p be a prime number, and let Z p — {0,1,2, — 1}. Z p is a 

field under the operations of addition and multiplication modulo p. 

(a) Show that 0,1 I are the zeros of x p — x (considered as a 

polynomial over Z p ). Conclude that x p — x = ,x(x — IX* — 2) • • • 
(x-(p- 1)) (mod/*). 
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(b) Wilson's theorem. From part (a), show that 

{p - 1)!= - 1 (mod p). 

(c) Consider the determinant \a^ of order 100 with a- = i ■ j. Prove that the 
absolute value of each of the 100! terms in the expansion of this 
determinant is congruent to 1 modulo 101. 

4.4.31. Let F be a finite field having an odd number m of elements. Let 
p{x) be an irreducible polynomial over F of the form x 2 + bx + c, b, c e F. 
For how many elements k in F is p(x) + k irreducible over FI 


Additional Examples 
1.1.5, 1.1.12. 



Chapter 5. Summation of Series 


In this chapter we turn our attention to some of the most basic summation 
formulas. The list is quite short (e.g., the binomial theorem, arithmetic- and 
geometric-series formulas, elementary power-series formulas) but we shall 
see that a few standard techniques (e.g. telescoping, differentiation, integra¬ 
tion) make them extremely versatile and powerful. 


5.1. Binomial Coefficients 


Here are some basic identities; we are assuming that n and k are integers, 
n > k > 0. 


Factorial representation : 


* k 

ii 

(1) 

Symmetry condition: 


«)-(.-*)■ 

(2) 

In-and-out formula: 



(3) 

Addition formula : 


(iHVM;:!)- 

( 4 ) 
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The next formula is obtained by repeated application of the addition 
formula. 

Summation formula : 

(oH"T 1 ) + '" + ("£ t H" + £ +l )' < 5 > 

Sums of products (see 4.3.2 and 1.3.4): 

(xmi )(.:.) + - + (X)-r:*)- < 6 > 

Binomial theorem (see 2.1.1, 2.1.11, 4.3.3): 

£( n k ) x y~ k -( x+ y)'- ( j ) 


5.1.1. Use the summation formula to show that 


(a) 1+2 + 3+ *- * + n = 


n(n + 1) 


(b) l 2 + 2 3 + - ■ * n 2 = 


«(« + l)(2/j + 1) 


Solution, (a) We have 


i +2+ ... + „„(!) + ( 2 ) + ( 3 ) + ... + (») 

-(i) + M) + - + (.-.) 




(b) We first look for constants a and b such that 

e=a Ci) +b (\ )-»-^V^ +wr 

for k = 1,2, . .. , n. Think of each side as a polynomial in k of degree 2. 
The identity will hold if and only if the coefficients of like powers of k are 
equal; that is, if and only if 

I « a/2, 

0= -a/2 + b. 



156 


5. Summation of Series 


This yields a = 2 and b = 1 . It follows that 

= Ki) + (l)] + [f2) + 0)] + '-' + W2)-(")] 

-[(^( 3 2) + -" + (2)] + [(i) + (?) + - + a)] 

„ («+ !)(«)(" “ 1 ) («+ ')" 

= 2 —- z -+-— 

n(n + l)(2n + 1) 

= 6 “ 1 

(Another approach for part (b) is given in 5.3.11.) 

The preceding sums occur so often that it is desirable to memorize them 
or in some way be able to recall them easily. One way to remember the first 
formula is shown in Figure 5.1 (for n = 5). 

The diagram also prompts the following argument for the general case. 
Let S denote the sum of the first n positive integers. Then 

S = 1 + 2 +■*•+«, 

S = n +(/t - l) + ■ • • + 1. 
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Adding, we get 

2S = (n + 1) + (n + I) + - * • + (n + 1) 
= n(n + 1), 


and it follows that 


S = 


n(n + 1) 
2 


The technique of evaluating a sum by rearranging the terms is a 
common one. In particular, when the terms are represented as a double 
summation, it is often advantageous to exchange the order of summation. 
The next example is an illustration of this idea. 


5.1.2. Sum 




Solution. The terms of this sum are indexed by ordered pairs (i,y), where 
(/,_/) vary over the elements in the following triangular array: 


23 4 


0 • 

] * * 

2 * * * 

3 * * * * 

4 * * * * * 


In the given sum, the elements are first added columnwise. When we 
interchange the order of summation, so that the terms are first added 
rowwise, the sum is expressed in the form 



or, equivalently. 
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This is easily evaluated in the following manner. According to the binomial 
theorem, 

When x = 1, we get 



The original sum is therefore 

£(•)*. 

i-O v '' 

which by the binomial theorem is (I + 2)" = 3". 


5.13. Sum the following: 

( a,(») + 2(;) + 3(^) + ... + „(-), 

< b >'4("MG) + "-;TTT«)' 

Solution. The first sum is 

1,0 

Our aim, in summations of this type, is to use the in-and-out formula to 
bring the index of summation “inside” the binomial coefficent. Since 


it follows that 


and therefore 


(;)-?(;:,')■ 

'CM-:, 1 )- 

-SC:') 


= n x 2"" 
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The second sum is 


SrM7)- 

in 


and therefore, 


V = Y 1 f"+ M 

= -L_ v(" + M 
«+ i “ 0 l i + i / 

= _Lt 2" + 1 _ i], 
n + 1 L J 

There is another instructive way to handle these sums, based on differen¬ 
tiating and integrating each side of 

s (;)*'-<>+*)■• 

i -0 v ‘ ' 

For part (a), we differentiate to get 


and, with x = 1, we get 




= nxr 1 . 


For part (b), we integrate to get 

,“oW i + 1 fl + I 

When x = 0, the left side of this equation is 0, and this implies that 
C = — 1 /(n + 1). Thus, when x = 1, we get (as before) 
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5.1.4. Show that 

(7) - i (3) - i (5) — * t- ir- ^ i + ■ ■ ■ + i - 


Solution. The left side of the identity looks like the definite integral of a 
binomial series, and this provides the idea for the following argument: 

1 -( 2 )*’+<;)*’- •••■ 

We are now set up to integrate each side from 0 to 1, and we get 


To finish the problem, we must show the integral on the left is equal to 
1 + 1/2 + 1/3 + • * * + 1/n. Let / = 1 — x. Then 


I 


■ 1 - (i 


?L ix =f"—yl dy 

Jo \~y 

= ['(, +jk+/+.../-'Hk 

= y+ iy+ . . - +-V ] 1 

L “Jo 


The problem can be done without calculus, using the basic identities of 
this section, but it is technically harder. However, since it is instructive, we 
will sketch the idea. 

First, by repeated use of the addition formula and the in-and-out 
formula, we have, for n > > 1, 



(" 7 1 ) + (;:f)]-H" 7 ') + iO) 
M"7 2 ) + ^(VKQ 


and continuing in this way, we get 
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Therefore, 


I, (-r'K")- 1 , TTJ 

When we interchange the order of summation, we obtain 

‘« -J 


n- t 

s 




Let k = n - j, so that the right side is 

n k 


2 2(-i)"‘()U-2 I 2(-')'*'(*) 

t = I i—l \ I / * £=1 * /=] Vi/ 


= 2 


(see 5.1.9(a)). 


5.1.5. Sum 


«— I «+t / . \ \ 

2 2 ‘ 

i = 0 j = i +I \ } /\*/ 


Solution. This can be evaluated using the basic identities of this section; 
however, we want to illustrate another technique. Although this approach 
will seem artificial and unlikely, the fact is that the thinking is not as 
unusual as it might at first appear. The idea is to interpret the sum in 
probabilistic terms, in the following manner. 

Multiply the sum by l/2 Zn + 1 and write it in the form 

n — I 

2 

1 = 0 

Now consider the following matching game between players A and B. 
Player A flips n + 1 coins and keeps n of the coins to maximize the number 
of heads. Player B flips n coins. The player with the maximum number of 
heads wins, with ties awarded to B. 

Observe that the above sum represents the probability that A wins. We 
will now calculate this probability in another way. 

The game is equivalent to the following. Let A and B each flip n coins. 
The player with the most heads wins. If they each have the same number of 
heads, but not all heads, A flips the (n + l)st coin, winning if it is heads 
and losing if it is tails. At this point, A and B have equal chances of 
winning. 

In the remaining case, both A and B have all heads. In this case B wins 
regardless of ^’s last toss. Thus, B wins in exactly 2 more cases than A. 
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That is, out of the 2 2n+1 total flips, B wins in the 2 cases last described, and 
B wins in exactly one-half of the other cases ({ (2 2 " +1 —2)). Thus, the 
probability that A wins is 


1 — Pr(B wins) 


2 + i(2 2H+1 -2) 


- 2 - 2 2 " + 1 
2 2 " + 1 


2 2 " — 1 
2 2 " + 1 


It follows that the original sum is 2 2n — 1. 


Problems 

5.1.6. 

(a) Sum all the numbers between 0 and 1000 which are multiples of 7 or 
II. 

(b) Sum all the numbers between 0 and 1000 which are multiples of 7, 11, 
or 13. 

5.1.7. 

(a) Prove that for any integer k > 1 and any positive integer n, n k is the 
sum of n consecutive odd numbers. 

(b) Let n be a positive integer and m be any integer with the same parity as 
n. Prove that the product mn is equal to the sum of n consecutive odd 
integers. 

5.1.8. Use the summation formula (5) to sum (a) _ ( k 3 ; (b) 

5*1.9. Sum each of the following: 

«'-(l) + (2)-(3)-- + <-')-© 

(b) lX2(") + 2x3(") + . . + („-l)„(»). 

W (iM'iMsh-+"■(:)■ 

(“) (7) - 2 2 ( 2 ) + 3 2 (") — + (-ir*'" 2 (")- 

(•)(S)-5(T)4C2)-- + (-')‘du(:)- 

(t > &MA)/,«?«/]■ 

5.1.10. 

(a) What is the probability of an odd number of sixes turning up in a 
random toss of n fair dice? (To evaluate the sum, consider {[(* + yf — 

(x-yri) 
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(b) Show that if « is a positive multiple of 6, 

(■MXX)- 

CHGHG)- 

5.1.11. Prove the following identities: 

(!) (!) . ( 5 ) 


= 0 , 


= 0 . 


( a ) 


1x2 2x3 3x4 


■■■+("•) 


09 


n(n + 1) 


= 1 + 1 + 
2 3 


(!) KM!) 


« + 1 ’ 




a (i) 

(« + I) 2 


1 

n + 1 


1 + 2 + ' 


5.1.12. Show that 


<x)(x(x;.)x)(xx-xx;x(x)- 

p»(!) ,+ (T) ,+ (;) 1 + - + c) , -( 2 :)- 


5.1.13. Use the identities of this section to show that 


V r 2* f«\l 2 2/2«-2\ 

t=o[ n \*/J n l n — 1 / 


5.1.14. Sum 



(Hint: For i = n,n + 1, . .. , 2h — 1, compute /’(£), the probability that i 
tosses of a fair coin are required before obtaining n heads or n tails.) 


5.1.15. A certain student, having just finished a particularly hairy summa¬ 
tion, stared glassy-eyed at an “x l y 2 ” which was written on the scratch 
paper. After some doodling the student wrote: 

(0 *l/ 2 / 3*4 

( 2 ) * 1 / 273 * 4 / 5 * 6 * 7/8 

( 3 ) *1/2/3*4/5*6*7/8/9*10*11/12*13/14/15*16 ■ 
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On each line, the student copied the line above exactly, and then copied it 
again, changing x’s to v’s, v’s to x’s, and continuing the subscripts in order. 
The student noticed that the sum of the x-subscripts equals the sum of the 
^-subscripts in line (1). In line (2) the same equation holds, a similar one for 
the sums of the squares of the x- and y-subscripts, i.e., I 2 + 4 2 + 6 2 + 7 2 
= 2 2 + 3 2 + 5 2 + 8 3 . The student immediately made the inductive leap that 
in line (n), the sum of the Ath powers of the x-subscripts would equal the 
sum of the /cth powers of the y -subscripts for k = 1,2, . . ., n. Prove this for 
all n > 0 . 


Additional Examples 

1.3.4, 1.3.15, 1.11.4, 2.1.1, 2.1.2, 4.3.5, 4.3.13, 4.3.14, 4.3.15, 4.3.24, 5.4.8, 
6.8.3, 7.2.9, 7.3.8. Applications of the binomial theorem: 1.1.1 (Solution 4), 
1.1.2, 1.3.8, 1.6.6(b), 1.12.4, 3.5.8, 3.5.10, 3.5.11,3.5.12, 3.5.13, 4.2.13, 4.3.5, 
4.4.9, 5.1.2, 5.1.15, 5.2.13, 6.8.3, 7.1.5, 7.1.15. 


5.2. Geometric Series 


The geometric series arises naturally in many problems, and it is therefore 
imperative to know its sum: 


n 


2 

1 = 0 


1 - x '’* 1 

1 - X 


X ^ 1, 


» . ” - 1 _ Y "+J 

Yx‘= lim Y x' = Urn 1-r-Z - 

,-to 1-0 " _ * M 1 - * 


= W<1. 

1 - X 


5.2.1. For a positive integer n, find a formula for o(n), the sum of the 
divisors of n. 


Solution. Clearly a(l) = 1. If p is a prime, the only divisors are 1 and p, so 
°(P) = P+ 1 - 

If n is a power of a prime, say n — p m , the divisors are I, p,p 2 , . . ., p m , 
so a(p m ) = 1 +p + ■■■ +p m = 0-p m+1 )/V-p)- 

Suppose n = ab, where a and b are relatively prime integers, each larger 
than one. Suppose the divisors of a are Oi,« 2 , . . . ,a s and the divisors of b 

are b t ,b 2 . b s . Then the divisors of n are a f bj, i = 1,2, .. ., s, j = 1, 

2, ... ,t, and the sum of these is 

(a,A t + • - • + a x b t ) + ( a 2 b t + * * • + a 2 b t ) + ■ ■ + ( a t b l + ■ • ■ + a s b,). 
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or equivalently, 

(a, + a a + ■ ■ ■ + fl M )(*t + * 2 +■■■+*()• 

Thus, o(n) = a{a)a(b). 

Consider now an arbitrary positive integer n, and suppose its unique 
factorization is 


n=PVPi ■■■??■ 


From the preceding work, we find that 


o(n) = 


i 

1 ~P\ 


)( 


1 

} 


i - P r ] \ 

i -Pk y 


5.2.2. Let n = 2m, where nt is an odd integer greater than 1. Let 8 = e lm ^ n . 
Express (1 — 0) 1 explicitly as a polynomial in 8, 

Offi k + a*_ 1 8* ' + ■ ■ • + ct]8 + £ 2 q , 

with integer coefficients a t . 


Solution. Notice that 9 is an nth root of unity, and that 8 m = (e 2 ’"^ 2 '")”' 
= e T ' = — 1. Thus, 


1 + 9 + 8 2 + 


I - 8 m = 2 

1 - 8 1-8‘ 


(>) 


Also, since m is odd, we have 


1 _ 8 + #2- ... + 8 m ~ 


1 ~(~8f 


= 0 . 


( 2 ) 


Now, adding equations (1) and (2), we get 

2 + 20 2 + * • • +29 m ~ 1 = ~~ , 


or, equivalently, 

—^ = 1 +02 + 04 + ... + 8 m ~ 1 . 

1 - P 


5.23. Sum the finite series cos# + cos20 + • ■ • + cosnS. 


Solution. The series we wish to evaluate is the real part of the geometric 
series 


«» + «“+,»+ • +e ni8 . 
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ffi(lt+1) ,_ e » 

e*-l e' e -l 

_ e H»+i)e _ e n{t>) _ , r e n«+i}e _ e ni») 
e K\») _ e ~K\») 2i sin£0 

= 4^ ■ *, n \fcosfn + — cosiff) 

21 sin^ff L v v 2> 2 } 

+ i(sin(« 4* \ )9 - sin^ff)] 

_ 2Sj9 [(“"(" + 

+ /(cosjtf - cos(m + 

Equating real parts, it follows that 

cos0 + cos 29 + * • * + cosn# = —- Tsinfn + i'ltf — sin iff] 

2 sin ^ L 7 J 

= sin(n 4* i)fl ^ ^ 

2 sin ^9 2 

5.2.4. Prove that the fraction 

I X 3 x 5 • * X (2n - 1) 

2x4x6*-* X 2 n ’ 

when reduced to lowest terms, is of the form a/ 2 W where a is odd and 
w < In. 

Solution. We can write the fraction in the form 

( 2 "> ! _ i n,\ 

l lK n\n\ 2 2n \nf ‘ 

Now, ( 2 „ n ) is an integer, so the only question that remains is to show that 
w < In. The highest power of 2 in (2«)! is 



(see 3.3.10), and in n\ is 

liMSMf] + -. + I?l + -- 
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It follows that 


00 n- -n 00 rr -i 

-2„ + 22 i -2 ^ 


2 k U “t H 2 


and therefore 


cOfT^Ti ^nr ti °°rr 

V 2 k _ ’v n _= V |JL 
Zj *t- Zi 1 Zj 


* -, a 2 * jt n 2 ' 


*=oll 2' 




- 2 » + 2 2 I4l-"- 2 If 4 

• ■ ^ k-i II 2 


“1 H 2 

00 

"+£[*! 

< « + 2 «/2 A 

Jt = ] 


5.2.5. For x > 0, evaluate in closed form 


Solution. Write x in the form 

00 a 

* = I* ] + 2 > 

n= I z 

where a n is 0 or I, and where, if x has the form m/2" (m an odd integer), 
we take a k = 0 for all sufficiently large k. 

For each n, [ 2”x ] is even if and only if a„ is 0. It follows that for each 
n, (— 1 ) 11 2 " x ® = 1 - 2 a„. Therefore, 

■ » (- 1 )^ « 1 - 2 «„ 

2 j yn yn 

n- 1 * n- I ^ 

» 1 “a 

= V 1-2 V — 

Z-j <■)* 

„=] * n-1 z 

= 1 ~2{x- 1 * 1 ). 


5.2.6. Evaluate in closed form 


2u X P + 1 - 1 ’ 

p,9)-l 


1*1 > 1 , 
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where the sum extends over all positive integers p and q such that p and q 
are relatively prime. 


Solution. 


2 

(/>■?)- 


i 

xP+i - 1 


2 

(/>•?) = 1 


1 


xr + i 


( 1 - l/xP+v) 



“ t 

2 2 

H -I V 


1 

x n(p + q) 


\ 


As p, q, and n vary over the index set in this sum, the powers of 1 / x will 
vary over all possible ordered pairs of positive integers Since the 

series is absolutely convergent (l/|x| < 1), we can rearrange the terms of 
the series into the form 


00 

{/'.?)=] « = 1 * 


_J_ 

nip + q) 


= 2 2 


<=i j -i 


i 

x ,+ j 



-( 

-( 


1 

1 - 1 /* 




( 


x 

x - 1 


■r 


Problems 

5.2.7. Let n = 2 p ~ \2 P — 1), and suppose that 2^-1 is a prime number. 
Show that the sum of all (positive) divisors of n, not including n itself, is 
exactly n. (A number having this property is called a perfect number.) 

5.2.8. Sum the series 1 + 22 + 333 + * • • + n( 11 .. . 1). 

' h " 

5.2.9. Let E(n) denote the largest integer k such that 5* is an integral 

divisor of the product 1 1 2 2 3 3 Find a closed-form formula for £’(5 m ), 

m a positive integer. What happens as m-»oc? 

5.2.10. A sequence is defined by o, = 2 and a„ = 3a„_ l + 1. Find the sum 

& l + ^2 " * * + 
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5.2.11. Verify the following formulas: 


(a) 

(b) 


n 


2 sin(2A: — 1)0 = 

i-i 


sin 2 «0 

sin# 


2 sin 2 (2k — 1)0 = — jn 
/-] 


sin4«0 
4 sin 20 


5.1.12. 


(a) If one tosses a fair coin until a head first appears, what is the 
probability that this event occurs on an even-numbered toss? 

(b) The game of craps is played in the following manner: A player tosses a 
pair of dice. If the number is 2, 3, or 12 he loses immediately; if it is 7 
or 11, he wins immediately. If any other number is obtained on the first 
toss, then that number becomes the players “point” and he must keep 
tossing the dice until either he “makes his point” (that is, obtains the 
first number again), in which case he wins, or he obtains 7, in which 
case he loses. Find the probability of winning. 

5.2.13. If a, b and c are the roots of the equation jc 3 - x 2 - x — I = 0, 


(a) show that a, b and c are distinct; 

(b) show that 


^1000 _ £1000 
a — b 


£1000 __ c 1000 
b - c 


+ 


c 1000 _ d 1000 
c — a 


is an integer. 


5.2.14. 

(a) Prove n VnT < Y[p\np X ^ p V) > where the product on the right is over those 
positive primes p which divide «: (Hint: First prove that 


+ 

n 

+ r « i 


■ < n 

\p 2 \ 

ipi 

T 

P ~ 



(b) Use part (a) to prove there are an infinite number of primes. (Hint: 
First prove that (nl) 2 > «".) 

5.2.15. Prove that n*-o(l + « S“=o*" = 1/(1 - *), M < 1. 

5.2.16. Evaluate in closed form: 2T=o(* 2 "/( 1 - * 2 " + '))> -*| < 1 

5.2.17. (a) Let p { , p 2 , . .. ,p„ be all the primes less than m, and define 

-ip 

Show that A(m) *» ^(p^'pp • * • where the sum is over all n-tuples of 
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nonnegative integers (a i ,a 2 , . . ., a„). (Hint: 




(b) Show that 1 + I /2 + 1/3 + • • • 1 fm < A(m), and conclude that 
there are an infinite number of primes. 


Additional Examples 

1.12.1, 4.1.4, 4.1.8, 4.1.9, 4.2.5, 4.2.8, 4.2.12, 4.2.18, 4.3.13, 4.3.18(c), 5.1.4, 
5.1.11, 5.4.1, 5.4.7, 5.4.9, 7.6.6. 

5.3. Telescoping Series 


Infinite series and infinite products can sometimes be evaluated by means 
of “telescoping.” The examples are self-explanatory. 


5.3.1. Sum the infinite series 


2 
i- 1 


1 

(3f-2)(3/ + l) ‘ 


Solution. The trick is to break the summand into a sum of partial fractions, 
with the result that most of the terms in the partial sum will cancel. We 
look for numbers A and B such that 


1 = A B 

(3/ - 2)(3i + I) 3i — 2 + 3/ + 1 

This leads to 

l = A (3/ + 1) + B(3i — 2) 
and equating coefficients, we have 

3A + 35 = 0, 

A - 2B= 1. 

It follows that A = B = - j. Thus 


n 


5.-2 

i-l 


J 

3/-2 


i 

3 

3< + 1 


1 

3 





■ * + 


( 3« - 2 


1 

3« + I 


)]■ 
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In this sum we have the “telescoping” property: the second term in each 
pair cancels with the first term in the successive pair, with the result that 



1 

3 n 4- 1 


It follows that the infinite series adds to lim fl ^^ S„ = 3 . 


53.2. Sum the infinite series 


1x2x3 2x3x4 3x4x5 4x5x6 


Solution. Again, by partial fractions, we look for real numbers A, B, C such 
that 


2/i + 1 _ A_ + B C 

«(«+ l)(n + 2 ) « «+l « + 2 ' 


We get 

2/j + 1 = A (« + l)(n + 2) + Zf/i(/i + 2) + Cn(r 1 + 1). 

Setting n = 0 yields A =£; setting «= —1 yields B= 1; setting n = —2 
yields C= — \. Thus, the nth partial sum is 


1 , 1 n 

-1 + 1- 1 - 

+ 

i , 1 1 

A + 1 -± 

12 3 


2 3 4 


[1 + 1- 
3 4 

1 

2 

+ * * 

+ 

1 

1 i 1 

" 5 

« — 2 + n 

- 1 n 

r 1 

_ ^ 1 

1 _ 

3 1 

2 


* + 1 

$ ~\ 
1 

n - 1 

n 

n + 1 


n n + \ 

n + 2 


In this case, the telescoping takes place across groupings: the last term of 
one triple cancels with the sum of the middle term of the next grouping and 
the first term of the third grouping after that: 



= 0 . 


The resulting sum is therefore equal to 


[1 + 1 1 


r 1 1 


r „ 1 i 


[ 1 

1 1 

+ 

2 

_i_ 

T 


2 

1 + 2 

2 

T 

n + 1 

T 

[ n + l 

n + 2 


_ 5 _ T 7 

4 n + l n + 2 ' 

It follows that the infinite sum adds to lim n ^, 0 O 5„ = |. 
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5-5-5. Express 


as a rational number. 




Solution. Let S be the desired sum. Then, leaving out the details of the 
partial-fraction decomposition, we have 




i 


mn(m + n + 2 ) 


00 00 

-sis 


1 


-i 71 «.= I «(« + « + 2 ) 


00 - oo 

-Sis 

* - 1 " m - 1 


l/{« + 2) l/(n + 2) 


m 


m + n + 2 


„?,»(« +2) [(' n + 3 ) + ( 2 n + 4 ) 


1 + i + i + 


ft + 2 


= y . J L 

,-i «(* + 2 ) L 

= n |,(i“7T2)( 1 + i + “■ + -^h) 

= i[(!-iX 1 + i + 1) + (i-i)(l +i + i + i) 

+ (t - f)( 1 + 1+3 + 4 + 5) + * ‘ ’ ] 

-*[0 + i + i) + i0 + ± + * + i) + HS + *) 

+ i(i + i) + Hi + +)+---] 


-I[i! + 1.25 + 

2 6 2 12 




1 


+ ( —+ 

\ 3 x 5 4 x 6 


3 x 4 4 x 5 

l , 1 


5X6 


...) 


5^7 + ‘")} 

-}[v + s + ((W) + (i-o + ■■■) 

+ i((i-i) + (i-i) + (*-l)+---)] 

-i[¥ + fi + i + * + t] 

1 = M = 2 
J 48 4 • 


44 + 25 + 8 + 7 
24 
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53.4. Sum the series 


2 3-w(i). 

n = l v J ' 


Solution. Using de Moivre’s Theorem, 
sin 3# = Im(e 3 ' fl ) = lm((* ftf ) J ) 

= Im[cos# + i sin 0] 3 

= Im[cos 3 # + 3 cos 2 #; sin 8 + 3 cos# r sin 2 # + i 3 sin 3 #] 
= 3 cos 3 # sin # — sin 3 # 

= 3[(1 - sin 3 # )sin#] - sin 3 # 

= 3 sin# — 4 sin 3 #. 


It follows that 


sin 3 # = 4 sin # — t sin 3#. 

4 4 


Thus, 


3 , 

4 sin 

4 




S k = 2 3' , -'sin 3 (^) = ^ 

n~=\ \ 3 / n=} 

-(! sin !-j s H + (f sin (?h!“"(!)) 

+ (t s, "(#)-t s, "(?)) + '" 

+ (T si "(^)-T sin (5^r)) 

= 4- sinf - 4 sin*. 

4 V 3* f 4 


Therefore, the series adds to 


lim S k — lim 

k —>oo £--*oo 


= lim 

k-> co 




4 \ 3 * 

$in(x/3*) 
(jc/3*) 


* sin a: 


—^-sinx 

4 


x — sin x 


The telescoping idea is particularly useful in solving recurrence relations. 
Here is an example; other examples are given in the next section. 



174 


5. Summation of Series 


53.5. A sequence of numbers satisfies the recursion 

.v 0 = 0, nx n — (n — 2 )x n _ , + 1 for n > 0. 

Find a closed-form expression for x n . 

Solution. We see that x 0 = 0, jc, = 1, x 2 = l, x 2 = \, and consequently 
x n = ± for all n > 2. But finding a pattern for a given recursion is not 
always so simple, and it is instructive to consider the problem in the 
following manner. 

For n > 2, multiply each side of the recursion by n — 1, and for each n, 
set y n = /;(« — l)x n . The recursion in terms of then’s is 

y„ = y„_, + (n - 1), >*,=0. 

It follows that 

/2-Ti = 1, 

T3 -T2 = 2, 

T4-T3 = 3, 


y«-y «-1 = « - !■ 

Adding (notice the telescope), we get 


(« - l)w 

y n — 1 + 2 + ••• +(«—!) =-^— 


and therefore, 


x„ — 


n — 7 ’ 


n > 2. 


Problems 


53.6. Sum the following: 

/ a \ JL + .2 + JL + . . . + " ~ 1 

( ’ 2! + 3! 4! + n\ ’ 

(b) 1 X l! + 2 X 2!+ 3 X 3!+ ■ ■ ■ + n X «!, 

2_^ _4 ^ 6 


( c ) 


1x2x3 2x3x4 3x4x5 


+ ■ * ■ + 


2 « 


n(n + l)(n + 2 ) 


53.7. Evaluate the following infinite products: 

(a) 

(b) ri".,(« 3 -1)/(« 3 +1). 

(c) Show that an infinite product can be transformed into an infinite series 
by means of the identity P = e log/ \ Work part (a) in this way by 
evaluating the infinite series 2 “-|l°g 0 — 1 /n 2 )- 
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53.8. Prove that for each positive integer m, 

2m _ . 

_ m _ < V — <r _2?_ i _ 3m 4- 1 _ 

(m + l)(2m + l) r= “ + 1 a 1 (m + \)(2m + 1) 4m(m + l)(2m + 1) 

(Hint: Notice that !//■(/■ + 1) < 1/r 2 < I /(r + l)(r — 1).) 

53.9. Let F t , F 2 ,... be the Fibonacci sequence. Use the telescoping 
property to prove the following identities: 

(a) F t + F 2 + ■ ■ ■ + F„ = F n + 2 - 1. (Hint: F„_ 2 = F„ - F„_ t .) 

(b) F t + F 3 + ■■F 2n _ x = F ln . 

(c) Ff = Fj+ ■ ■ ■ Ft + F n F n + ] . (Hint: F 2 = F„(F n + , - F„_,) = 
f f — F F .) 

(d) (Show that 1 /F b „,F b + 1 = 1/F„_,F„- 1/ 
W+i-) 

(*) S*- 2 F n /F„_ | F B+1 = 1 . 

53.10. Sum the following infinite series: 

(a) sin 3 * + 4 sin 3 3x + sin 3 3 2 x + - * * 

v 3 3 2 

(b) cos 3 * — ^ cos 3 3 x + cos 3 3 2 x + - * * , 


53.11. 


(a) Use the identity (k + l ) 3 — k 2 = 3 k 2 + 3k + 1 to evaluate the sum of 
the first n squares. (Hint: Let k vary from 1 to n in the given identity, 
and consider the sum, on the left side and the right side, of the resulting 
n equations.) 

(b) Use the telescoping idea, as in part (a), to evaluate the sum of the first n 
cubes. 

(c) Find 


lim 

n —>oo 


i (5 A 4 - 18 A 2 At 2 + 5& 4 ) 

n h-\k ~i 


53.12. Show that the reciprocal of every integer greater than 1 is 
the sum of a finite number of consecutive terms of the infinite series 
2“.,i/«(« + i). 

53.13. If m > 1 is an integer and x is real, define 


Show that 


/(*) 


oo m — 


2 2 


' 

x + im k 



1 


, 

m 



(Hint: See 1.2.3.) 



[x] 

I*+ll 


if x > 0, 
if x < 0. 
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53.K Solve the following recurrence relations (by the methods of this 
section). 


(a) x 0 = l,x n = 2 ;c„_ l + 1 for n > 0. [Hint: Divide each side by 2 ".] 

(b) jc 0 = 0 , «x„ + (« + 2 )x„_ , + 1 for n > 0 . 

(c) * 0 = 1, = 1, = 2, x n+3 = x„ + 3 for « > 0. 

53.15. Show that a plane is divided by n straight lines, of which no two 
are parallel and no three meet in a point, into ((n 2 + n + 2 ) regions. 


53.16, Let 


Show that 


d„ = 


n + 1 n + 2 



d= 1 - 




+ 2n — 1 2« 


[Hint: Consider the telescoping series 2 "=i(4-h ~ 4)-l Conclude that 
d„~* log2 as n->oo. (For another proof of the first part, consider the 
difference of each expression from the harmonic sum I + 1 / 2 + ■ * + 

1/(2 (n - 1)). Also, see Section 6 . 8 .) 


Additional Examples 
6 . 6 . 6 , 7.1.8, 7.2.2. 

5.4. Power Series 


A power series is an expression of the type 

a 0 + a, jc + a 2 x 2 + ■ • * + a„x n + ■ • ■ 

where a 0 , a,, a 2 ,. . . are real numbers. 

Given a power series, we can define a function f(x ), whose domain is the 
set of those real numbers x which make the power series into a convergent 
infinite series, and whose value is given by 

/(c) = a 0 + a t c + a 2 c 2 + * • * + a„c" + * • * 

for any c such that the right side converges. 

Given a power series SZ-o^-v', it can be shown that exactly one of the 
following holds: 

(i) The series is convergent for all real numbers x. 

(ii) The series is convergent only for x = 0. 
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(iii) There exists a real number r such that the series is convergent for 
|x| < r and divergent for |x| > r. 


We define the radius of convergence to be + oo if (i) holds, 0 if (ii) holds, 
and r if (iii) holds. 

We can ask the following question: Given a function/, is/represented 
by a power series? One result along these lines is Taylor’s theorem (with 
remainder): If / can be differentiated as many times as we like on an 
interval [O.a], then 


/'( 0 ) 

/(*)=/(<>)+ :L ± i x + 


1 ! 


/"(°) 

2 ! 


x 3 + 


/ (n >(0) 


+ *„(■*), 


where R„(x) — f { " + ' ) (c)a n+ '/(« + I)! for some c, 0 < c < a. The impor¬ 
tant part here is that if R„(x) is well behaved, so that /?„(*)~»0 as «-»oo, 
then 


/(*) 


CO 


2 


rx o) 


«! 


This gives us a method of finding a power series for a given function f(x). 

Using this idea, one can find power-series expansions for the most 
common elementary functions. The following series occur so often they 
should be memorized: 


y.2 

«'-n-* + f r + f r +---+x- + ---. 0 


sinx = 

x 3 

Y — __U 

3! 

x 5 

51 

„ v 2 n — I 

■“ + <- ! > (2-J-1JI + ■■■- 

(iO 

cosx *= 

i- 2 

I — ~ + 
2 ! 

x d 

4! 

i ( n" x 2 " i 
{ > (2n)! 

(iii) 

1 = 

1 - X 

1 + x + x 

2 + . . 

• + x"+ * • • , |x| < I, 

(iv) 

log(l + X) = 

x 2 

^t + 

x 3 

3 

■•• + (-0 n f- 1*1 <1- 

<v) 


(1 + x) r = 1 + ( j )* + [j)* 1 + ‘ ' ‘ + + ‘ 1 ‘ > r real, 1*1 < 

(vi) 

where 

r(r~ l)(r- 2 ) • • ■ (r-n + 1 ) 
nl 


5.4.1. Prove that e is an irrational number. 
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Solution. Suppose e = h/k, where h and k are integers. Using the power- 
series expansion for e*, and setting x = 1 , we have 


^ =1 + ir + i! + “' + rr 


i 


i 


(A:+1)! (* + 2)! 

Multiply each side by k\, and write in the form 

1 


+ - - - 


k\ 




1 ! 2 ! 


k\ 


k + 1 (k + !)(& + 2 ) 


Notice that the right side of this equation is positive, and the left side is an 
integer. Thus, the left side is a positive integer. However, on the right side. 


<■ 


_l_ . . 

1 

(k + 1 )(£ + 2 ) 

1 

1 + * 

k + 1 

k + 2 

1 

1 + 1 

k + 1 

k+ 1 

1 

1 

k+ 1 

1 -/^ 


k + 1 V k } k 


Thus, the right side is not a positive integer, and we have a contradiction. It 
must therefore be the case that e is irrational. 


5.4.2. Show that the power-series representation for the infinite series 
— cannot have three consecutive zero coefficients. 

Solution. The series sums to f(x) = c 1 ' 1-1|! . To find its power-series repre¬ 
sentation, we need to compute / (n) (0) for « = 1,2,3,,. . . . We have 

j f(x) = e* (( - 1 ) 3 (3x I -4x+ 1), 

which is of the form 

J'(x)=f(x)g(x) 

where g(x) is a polynomial of degree 2. It follows that 

/"(*) =/'(*)£(*) + /(■*)£'{*)’ 

/'"(■*) =f M (x)g(x) + 2 f(x)g'(x) +/(x)g M (x), 

f' v \x) =f"(x)g(x) + + SfWg'V) 
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(note: g'"(x) = 0). An induction argument shows that for n = 3,4,5, . . . 

/"+'>(*) =/ n> (*)g(-*) + + b,f*- 2 \x)g”(x) 

for some integers a n and b n . 

Suppose that three successive terms in the power series for f(x) were 
zero, say f M (0) = / ( "“ '*(0) =/ (H_2> (0) = 0. Then, the recursion of the last 
paragraph implies that f <k) (0) = 0 for all k > «, and this means that f(x) is 
a polynomial, a contradiction. Therefore we are forced to conclude that the 
power series for f(x) cannot have three successive zero coefficients. 

5.43. Evaluate lim*_ m [(e/2)jr + * 2 [(1 + \/x) x — e]]. 


Solution. We will find the first few terms of the Taylor series for (1 + 
in powers of 1 /x. We have 


l fxf 


1 \ _ e Jdog(i + i A) = 


(' 4 )- 


exp< x 


(\/xf (l/xf 


( 1 /*)"^ + 


exp 






e * e 

= e 


= e 


■-i-i(i)-■■■ 

I_ i + ii(i) 2+higherpower50f 


It follows that 


lim 

x -*<x> 


%x + x 2 

[(l + ir-ell 

2 

\ X } J 


= lim 

x —>cc 

= lie 
24 ‘ 


4rr + higher powers of — 
24 x 




— e 


An extremely useful fact about power series is that they can be differen¬ 
tiated and integrated term by term in the interior of the interval of 
convergence. By this we mean that if has a radius of convergence r 
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and if f(x) = '£a„x n , then 

/'OO-S *’.*"- 1 and 

and both resulting series have radius of convergence r. 

One consequence of the preceding result is that the power-series repre¬ 
sentation of a function/ is unique; that is, if f(x) — ^a„x" = ^b n x”, then 
a„ = b n for all n. In fact, a„ = b n = f ln \0)/n\. To see this, simply differenti¬ 
ate each side of f(x) = '£ i a n x n repeatedly and evaluate each successive 
derivative at x = 0. For example, /(0) = a 0 ; f'(x)= ^na n x n ~ ] , so /'(0) 
= a t ; f"(x) = ^n(ti - \)a„x n ~ 2 , so /"(0) = 2!a 2 , or equivalently, a 2 
— /"(0)/2 !, and so forth. 


5.4,4. Sum the infinite series 



Solution, Begin with the series 


Multiply each side by x: 


= 2 


“n nl 


n —0 


1 ■ 




and differentiate each side: 


« In 4- ltx" 

0 +*)**=£ / 


n = 0 


Multiply each side by x again: 


, , 2 , , £ («+ 0 *' 


n«0 


and differentiate to get 


(1 + 3x + x 2 )e x = 2 


« (n 4- l) 2 x n 


n = 0 


nl 


Now set x = 1, and find that 


oo 


2 


(»+1 f 

«! 


= 5e. 


The following theorem is often useful. 
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Abel’s Limit Tbeorem. Let r > 0, and suppose converges. Then 

converges absolutely for | a‘| < r, and 

lim 2 a * x "= S a „r n . 

s~>r n-0 /7-0 


5.4.5. Sum the infinite series 


1 - i + i - -L + 
1 4 ’ 7 10 -T 


Solution. We know that 


and therefore 


— = 1 - x + - a: + - - * , |jc| < 1, 


P dx + ... I*. <x 

Jo l + x > 4 + 7 10 + ’ |JC|< L 

Now, the series on the right side converges for x = 1 (by the alternating- 
series test), and therefore, by Abel’s limit theorem, 

1-1 + 4-A+ ■ ■■ = jTtt4‘ 

*->l Jo | + X 

This integral can be worked by partial fractions (the details are not of 
interest here), and we get 


f' dx = i log 

Jo 1 + a 3 3 JVZ 


yl - a + x 2 

+ & arctan ^ x ~ * - arctan — 

L ■ & & \ 


Thus, the series sums to 


log2 + — . 

& \ 


5.4.6. Let S n = £"-i(-l)' + y' and S = \im„^ x S n . Show that 

sr-i<s.' s >-iog2-i- 


Solution, We must evaluate the double series 

OO 00 /_ i\« + ' 

2 2 —1 ■ 
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For this purpose, consider the function 

oo oo /_ +} 

n*)= 2 2 ~T*rr> W<»- 

j-l i = 1 l ^J 

Then 


It follows that 


n*)= 2 2 (-^) ,+y "(-») 

j- 1 '-i 

- 2(-i)(-*V2<-*)" 

/-i r=i 

-iv - —!— 


= 2<-i)(-*)'t 

/-i 1 


+ X 


“TT7 ^ {~ X Y 
1 + * >-e 


(1 + x) 


2 1 + X 


o + *r 


and we find 




dx 


(1 + ■*) 


F(x) - F(0) = log(l + *)]' + 


F(x) = log(l+x)+- r -L_-l. 

The series for F(x) is convergent for x = 1, so by Abel’s limit theorem, 
F(l) = )og2 + t-l = log2-). 


5.4.7. Given the power series a 0 + a { x + a 2 x 2 + • ■ ■ with a n = (n 2 + 1)3", 
show that there is a relation of the form 

a ,„ +P a „+1 + ?«„ + ! + ra „+3 = 

in which p,q,r are independent of n. Find these constants and the sum of 
the series. 

Solution. Substituting the given values of a n into the recurrence, we find 
(n 2 + 1)3" + p(n 2 + 2n + 2)3 n + l 

+ q(n 2 + 4 n + 5)3 " +2 + r(n 2 + 6 n+ 10)3 n + 3 = 0. 
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Now divide each side by 3". Then equating coefficients, we find that p, q, 
and r must satisfy 

3p + 9q + 27 r = — 1, 

2p + 12 q + 54 r = 0, 

6 p + 45 q + 270 r = — 1. 

These equations have a solution: p= — 1 , q = j, r = — 

For the second half, we wish to sum the series 

GO 

2 (n 2 + 1)3"*", 

n=0 

which breaks into two parts, 

00 00 
2 » 2 (3*) n + 2 (3*)". 

n—0 0 


Let S = 2n=oP-*) n - If |*| < j, S = 1/(1 — 3x). Therefore, from 


2 ( 3 *)"= -j—^7 ’ 1*1 

rt-0 1 


it follows that 


2«(3x) n -'-3 = 


/t-0 


(I -3x) 


2 ’ 


j] =0 (1 — 3x ) 


2« 2 (3-r l -3-t 


n-0 


3* 


(1-3*)' 


= 9* + 3 
(1-3 *) 2 


“L , „ 3*(3* + 1) 

2 » 2 <3*) - ---r 

«-0 (1 — 3jc ) 2 


Combining, we get 


“ „ 3x(3x + 1) . 

2 {n 1 + 1)(3*)" = V 2 ~ + j-V 
« = o (1 — 3*) 2 1-3* 


_ 18 jf 2 — 3x + 1 
(1-3 *) 2 


5.4,8, Evaluate in closed form: 
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Solution. We can compute the first few terms: 

MM)- 1 - 4 -- 3 ’ 

MoM^Mt)- 1 - 12416 - 5 ’ 

MMMJMS) 

= 1 -24+80-64= -7. 

From this pattern, we expect that S„ = (- l) n (2n + 1). 

If we think of proving this conjecture by mathematical induction, we are 
led to look for a recurrence relation. This leads to,the following reasoning: 

ln + k\_(n + k— l\ + /n + fc — 1 \ 

l 2k ) l 2k - 1 / l 2k } 

■[( n 2 + /-l 2 ) + ( n 2 + /-’, 2 )] + (' ,+ 2 r 1 ) 

-("iiV) + [(■ + u + u 2 )H” + u ■) 

= (n+k-2\.')(n + k~ l\_fn + fc-2\ 

l 2*-2 / \ 2k j l 2k }’ 


Thus, 


+ k- I 
2k 




*-£<- 4 >t*) 

+ 2 S <- 4 )*(" 

A-o v 

+ 2 s' <-<)*(" ■' + M - "s <-4)*(" -- 2 + M 

*-Q \ ZK f *=.0 V 

- -4"s‘(- 4)*("+ 2 *-') + 2 S„-,-S„- a 


rt—2 


2k ! 


k = 0 

= — I 2 *^«- 1 — S „-2 
= -2 S^ t -S„_ 2 . 
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Using this recurrence relation, we can use mathematical induction to 
establish the claim that S„ = (— l)"(2/i + 1). 

Consider the recurrence relation 

= — 2S„_, — S„_ 2 , S 0 = 1, S, = — 3, 

and, for the sake of the illustration, let us suppose that we are not able to 
discover the formula for S n by a consideration of the first few cases. We 
wish to give here a technique for making this discovery. The method is to 
make use of a generating function F(x), in the following way. 

Let F(x) be the name of the power series whose coefficients are S 0 , S lf 
S 2 , . . ., namely, 

F(*) = S 0 + $i x + S 2 x 2 + ■ ■ • + S n x " + ■ * * - 

We will act as though the series converges at to the function F(x). Then 

2xF(x) = 2S 0 x + 2 S { x 2 + 2S 2 x 3 + ■ ■ ■ + 2 S„_ l x n + ■ ■ • , 

x 2 F(x)= S 0 x 2 + s lX >+---+ $ n . 2 x n +---. 

Adding, and making use of the fact that S„ + 2S„_, + S„_ 2 = 0, we find 
that 

(I +2x + x 2 )F(x) = S 0 + (5, + 25 0 )jc, 

or, equivalently, 

F(x ) = 1 ~ * . 

(\ + x) 2 

We now express the right side of this equation as a power series. To do this, 
first differentiate each side of 


rb - 2 (->)'*” 

* ^ ■* n = 0 


to get 




(1 + n — 0 

Then multiply each side by x - 1 to get 


F(x)~ 'Z(-l) n n(x-l)x n - t 

n -0 

= 2 (- l )"**"- 2 (- 1 ) n « je '" 1 

n-0 n-0 

-2 (-»)'“"+2 (-»)>+ i )*" 


n-0 


7 — 0 


= 2 (-l)"(2«+ l)* n . 

n-0 

Here again, we find that S rt > the coefficient of is S n ■■ (- 1)"{2n + 1 ). 
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It is true that the genera ting-function approach shown here cannot be 
justified in a step-by-step manner, since we have completely disregarded 
convergence considerations. However, the method can nevertheless be used 
in similar problems to formulate conjectures (about the solution of the 
recurrence relation), and these conjectures can then be verified by other 
means (for example, mathematical induction). 


5.4.9. Sum the finite series a 0 + a x + • ■ • + a„, where u 0 = 2, a, =5, for 

« > 1, a n ~5«„_[ - 6 u„_2. 

Solution. The first few terms of the ^-sequence are 
2,5,13,35,97,275,393__ 

Here, a general formula for the nth term is not apparent, so we turn to the 
technique of generating functions. Consider 

F(x) = a 0 + + q-lX 1 + • • • + a n x" + • • ■ . 

We have 

-5xF(x) = -SuqJc -5 ayX 2 ~ ■ ■ ■ - 5a„^ ,x n - ■ • • , 

6 x 2 F(x) = 6 a 0 x 2 + • • • + 6 a„_ 2 x n + • • *. 

Adding and using the recurrence a n — 5 a n _, + 6 a „_ 2 = 0, we get 
(l - 5 jc + 6x 2 )F(x) = a Q + (u ( - 5u 0 )x, 

so that 


Fix') = --——- 

( > (1 — 2jc)(1 — 3jc) ‘ 

Write this as a sum of partial fractions, and make use of the geometric 
series, to find that 

F ( x ) = T ^ 2 x + T--"3jc 

-2(2 *)'+f (3*)' 

i-0 f-0 

- 2(2* + 3 V- 

i-0 

Thus, a t = 2' + 3' for i = 0,1,2,3, . .. . [As a check, we can verify this 
formula by induction. Note that u 0 = 2° + 3° = 2, a, = 2 + 3 = 5, and, for 
i > 2, a, = 5a,-j -6^ = 5(2'” 1 + 3'"" 1 ) - 6(2'"" 2 + 3'" 2 ) = 5(2' — 1 +3' _1 ) 
- 3 X 2 1 ' 1 - 2 X 3 , _ 1 » 2' + 3' .] 
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We are now ready to compute the sum: 


n n 


a 0 +a t + ... +<*„= S(2' + 3')= 2^+ 2 3' 

,-0 1=0 J =0 

= 2 JI+1 — 1 3 n + ' - 1 
2-1 3-1 

, , i "+1 _ i 

= 2" +1 - 1 + ————- 


= 2 n + 2 + 3” +l - 3 
2 

5.4.10. Find a closed-form expression for T n , if T 0 = 1 and for n > 1 

T n =T 0 T n _ 1 +T l T n -2+ ■■■ + r n _,r 0 . 


Solution. This recurrence relation arose in 3.5.12. To solve it, let 
f( x ) =T<>+ TiX + T 2 x 2 + ■ • • + 7> n + • ; • 

and set 

F(x) = xf(x) = Tqx + T t x 2 + T 2 x 3 + ■ • ■ + T n x n+1 + • • 
The reason for this step is that 

(F(x)) 2 -T*x* + {T 0 T 1 +T l T 0 )x 3 +.-- 

+ (ToT n _ l + T l T n _ 2 + ... + T„. 1 T 0 )x”*' + . 
so in view of the recurrence relation we have 

(F(x)) 1 =T l x 2 +T 2 x>+ ... + t kX ’+'+--- 
= F(x) - Tqx. 

Using the quadratic formula we find that 


F(x) = . 


1 -Vl -4x 


(We choose the negative sign because F(0) = 0; the positive sign would 
yield F(0) = 1.) 

Now by the power-series expansion, 


^4^ = 1 + ( j )(- 4x) + ( *)( -4xf 
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It follows that the coefficient of x" + 


T = 



in F(x) is 


1 

2 


(iHK-D-W 

(«+!)! 




1 . 1 X 3 X 5 X • • • X (2n - 1) _ (-1/ 

2 (n + 1)! 2" +1 ^ 


X Ix2x3x4x x2/i . 4" +1 

2 (n + l)!-2"n! 2 n + l 


1 . yny. 

n + 1 «!«! 


1 

n + 1 



In a manner analogous to the case for real numbers, we can introduce 
the notion of a complex valued power series 

CO 

2 4,*". 

n-* 0 

where the coefficients may be complex numbers and z a complex variable. 
The values of z for which this series converges defines a function 

/(*)“ 

rt =0 

It can be shown that the power series (i)-(vi) given for the elementary 
functions at the beginning of the section continue to hold when the real 
variable x is replaced with the complex variable z. 

A useful fact regarding complex power series is that if/(z) = 
then Re f(z) = 2® =0 Re(a„z fl ) and Im/(z) = 2“=oIm(a n z n ). 

As an example, we will justify the use of the formula e a = cos 9 + i sin# 
introduced in Section 3.5. We have 


Ree' e = Re £ 


{ wy 

n ! 


00 


2 Re 


(«)" 

n \ 


« (-1 ) k 9 lk 

So W 


= cos# 


and 


Ime' 




Jt-o 


n-0 


00 r- n*# 2 * +1 

2 - -sing. 


«! (2fc + l)I 


It follows that e‘ 9 = Ree* + i\me' 9 = cos# + / sin#. 
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5.4.11. Sum the infinite series 

I .3 

S = rcosd + y cos 2# + ycos30 + 0 < /■ < 1, 0 < 0 < it. 

Solution. Consider the infinite series 

-log(l -Z) = Z+ y + y + '- -+ ^- + -- ‘, \z\ < 1, 

and set z = r(cos0 + /sin0). Then 
— log( 1 — r cos 8 — ir sin 8 ) 

r 2 (cos 20 + i sin 25) 

= r(cos 0 + isin 0 ) H-y-- + • • • , 

and taking the real part of each side gives 

Re( - log( 1 - r cos 8 - ir sin 8 )) 

= rcos8 + y cos 28 + y cos 3 8 . 

Now, for a complex number w>, log m? = log|w[ + rargw. It follows that 

rcostf + y cos 20 + • • * = -log^/(l - rcos9f + (/-sinS ) 2 

= — logi/l — 2 rcos 0 + r 2 . 


Problems 


5.4.12. Let p and q be real numbers with l/p — \/q = 1, 0 < p < A. Show 
that 


P + iP* + jP 


2 - 1 - I n 3 + 


= q~{q 2 + tf 


5.4.13. Find the power-series expansions for each of the following: 


(a) 1 /(jc 2 + 5x + 6). 


(b) 


l + * 

(1 + x 2 )(l - JC) 2 


(c) a resin ,r. 

(d) arc tan x (use this to find a series of rational numbers which converges 
to it). 
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5 . 4 . 14 . Sum the following infinite series: 


* f — 4ir r ) 

W (2/1 + 1)! 

< b > 1 + 5 + W 


, r a nonzero integer. 

, 1x3x5 
3!3 3 


00 

(<*) 



(ir^dr^d) 4 -- 

l 2 + 2 2 _i . I 2 + 2 2 + 3 2 3 , l 2 + 2 2 + 3 2 + 4 2 
2! 3! 4! 


+ * • * . 


5 . 4 . 15 . Let/ 0 (;c) = e x and f n+ ,(x) = xj^(-x) for n = 0,1,2,, . . . . Show that 

£ /*0) _ 

n =o « ! 

(Hint: Consider g(x) = e e '.) 

5 . 4 . 16 . Prove that the value of the nth derivative of jc 3 /(jc 2 — 1) for x = 0 
is zero if n is even and — n! if « is odd and greater than 1. 

5 . 4 . 17 . Show that the functional equation 

is satisfied by 

/(*) = 1+ i* 2 + i** + 4* 6+ ‘ ‘ ‘ ■ 1*1 < L 

5 . 4 . 18 . Using power series, prove that sin(* + y) = sinxcos^ + cosx 
sin^. 

5 . 4 . 19 . Show that 



5 . 4 . 20 . Let B(n) be the number of ones in the base 2 expression for the 
positive integer «. For example, 5(6) = 5(110 2 ) = 2 and 5(15) = B( 111U) 
= 4. Determine whether or not 

» B(n) 

eX P 2 j , tv 

„_] «(« + 1 ) 


is a rational number. 
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5.4.21. For which real numbers a does the sequence defined by the initial 
condition w 0 = a and the recursion w n+1 = 2w„ — n 2 have t/ n > 0 for all 
n > 0? 

5.4.22. Prove that 

(|^) 3 = 1+6* + 18* 2 + * ■ * + (4n 2 + 2)*" + • • ■ , |*| < 1. 


5.4.23. Let T fl = £?=, (- 1) /+ '/(2i - 1), T=lim n ^ T„. Show that 


2(^-n = f 

n -I ° 


1 
4 ' 


5.4.24. Solve the recurrence relation a 0 = 1, a x = 0, a 2 = —5, and for n > 3 
a n = 4 a„_ | — 5a„_ 2 + 2u„_ 3 . 


5.4.25. Use the technique of generating functions to show that the nth 
Fibonacci number F n is equal to 


F = 



1-V5 

2 


n 


Vs . 


5 . 4 . 26 . Sum the finite series a 0 + u, + ■ • • + a„, where a 0 = 2, a t = 17, 

and for i > 1, u, = 7 a, — 12zr,_ 2 . 

5 . 4 . 27 . Show that the power series for the function e ax cosbx, a > 0, b > 0 
in powers of * has either no zero coefficients or infinitely many zero 
coefficients. 

5 . 4 . 28 . Sum the infinite series 

5=1— 2rcosfl + 3r 2 cos20 — 4r 3 cos0 + • • • , |r| < 1. 

5 . 4 . 29 . Show that 2;T=o( s * n nF)/n\ = sin(sin0)e cosS . 

5 . 430 . Use infinite series to evaluate lim^gjV* 3 - 5* 2 + 1 - *]. 


Additional Examples 

1.12.1, 5.3.16, 6.8.1, 7.6.7(c). Also, see Section 5.2 (Geometric Series) and 
Section 7.5 (Inequalities by Series). 



Chapter 6. Intermediate Real Analysis 


In this chapter we will review, by way of problems, the hierarchy of 
definitions and results concerning continuous, differentiable, and integrable 
functions. We will build on the reader’s understanding of limits to review 
the most important definitions (continuity in Section 6.1, differentiability in 
Section 6.3, and integrability in Section 6.8). We will also call attention to 
the most important properties of these classes of functions. It is useful to 
know, for example, that if a problem involves a continuous function, then 
we might be able to apply the intermediate-value theorem or the extreme- 
value theorem; or again, if the problem involves a differentiable function, 
we might expect to apply the mean-value theorem. Examples of these 
applications are included in this chapter, as well as applications of 
L’Hopital’s rule and the fundamental theorem of calculus. 

Throughout this chapter, R will denote the set of real numbers. 


6.1. Continuous Functions 


A real-valued function / is continuous at a if f(x) -» f(a) as x^a, or more 
precisely, if 

(i) f(a) is defined, 

(ii) lim x _, Q f(x) exists, and 

(iii) lim^ 0 /(x)=/(a). 

(H a is a boundary point in the domain of /, it is understood that the x’s in 
(ii) are restricted to the domain of/. We will assume the reader is familiar 
with these contingencies.) 
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A function / is continuous in a domain D if it is continuous at each 
point of D. 

It is not difficult to prove that / is continuous at a if and only if for every 
sequence {x n } converging to a, the sequence {/(x„)} converges to f(a). 

The sequential form for the definition of continuity of / is used most 
often when one wishes to show a function is discontinuous at a point. For 
example, the function / defined by 

,, , sin — 

/(*) - x 

lo 

is discontinuous at 0 because, for instance, the sequence x n = 2/(4 n + 1)tt 
converges to 0, whereas {/(-*„)} = {sin(2irn + ^ir)} converges to 1 (rather 
than to /(0) = 0). 

6 . 1 . 1 . Define /:[0,1]^[0,1] in the following manner: /(l) = 1, and if 
a = .Q l u 2 ^ 3 fl 4 ... is the decimal representation of a (written as a terminat¬ 
ing decimal if possible; e.g., .099999 .... is replaced by .1), define/(a) = 
.0a|0 a 2 0a 3 .... Discuss the continuity of /. 


if x ¥= 0, 
if x = 0 


Solution. Observe that/is a monotone increasing function. We will show it 
is discontinuous at each terminating decimal number (i.e., at each point of 
the form IV/IO", N an integer, 1 < JV < 10"). 

Consider, as an example, the point a = .413. By definition, f(a) = 
.040103. Now define a sequence x„ by 

x, = .4129, 

x 2 = .41299, 

x 3 = .412999, 


x„ =.412 999... 9 . 

n limes 

The sequence {x„} converges to a; however, 

f(x n ) =.040102 09 09 . .. 09 


n pairs 

and we see that {/(x„)} does not converge to f(a). Thus/ is not continuous 
at a. 

A similar construction can be made to show that f is discontinuous at 
each terminating decimal number. The argument is based on the fact that 
the terminating decimal numbers have two decimal representations, 
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namely, 

a =.a i a 2 a i . . . a n _^a n , a n ^0, 
a=. ai a 2 . . .a„_ t (a n - 1)999 .... 

Now suppose that a in (0,1) is not a terminating decimal number. We 
will show that/ is continuous at a. Write a in its unique decimal form: 

a = a t a 2 a 3 a 4 .... 

Because the number a is not a terminating decimal number, there are 
arbitrarily large integers n such that a n ¥=0 and a n+ , ^ 9. For each such n, 
define X„ and Y n by 

Y „ • • ■ a n ( a n +1 + 0 = x * + • 

Then a e (X „, F„). Moreover, the first n digits of each of the numbers in 
(A'n, Y n ) are the same as those of X„ and Y n . Consequently, all the numbers 
within (X n , Y n ) are mapped to the interval (f(X„), f(Y n )). 

It is clear that the sequences { X„} and { Y n ] converge to a\ furthermore, 
the sequences {'f(X n )} and {/( Y„)} converge to f(a). Since any sequence 
{ x n } which converges to a must eventually become interior to (A",,, Yj for 
any n, it must be the case that {/(*„)} converges to f(a). It follows that/ is 
continuous at a. 

The preceding example is difficult to visualize geometrically, and a 
thorough understanding of the proof requires a clear understanding of 
continuity. The next example also demands a precise rendering of the 
definition: a function f is continuous at a if for every e > 0 there is a 
number 8 > 0 such that |jc — a\ < S implies |/(x) — /(u)| < e. 


6.1.2. Suppose that f.R->R is a one-to-one continuous function with a 
fixed point .v 0 (that is, f(x 0 ) = x 0 ) such that /(2jc -/(*))= x for all x. 
Prove that f(x) = x. 


Solution. Let 5= {x|/(x) = x). Because / is continuous, the set 5 is a 
closed subset of R (i.e., if x n G S and jc„ x, then x Q S\ this is because 
x = lim n ^„x„ = lim n ^ M /(xJ = /(lim n ^ M * n ) =/(*).) 

Now suppose that S ^ R. Let x 0 be a boundary point of S (every 
neighborhood of x 0 contains points that are not in S\ note that x 0 6 S 
because S is closed). 

If y is a point that is not in S, there is a nonzero real number r such that 
/(_>■) = y + r. The fact that / is one-to-one and satisfies f(2x — f(x)) =* x 
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implies that 

f(y + nr) = (y + nr) + r 

for every integer n (this is the content of 2.1.12). This identity is crucial to 
the argument which follows. 

Here’s the idea: Suppose x is not in S; that is, f(x) y= x. Choose y in 
R — S so that y is “close” to x 0 and f(y) is “close” to y (this can be done 
because f is continuous at x 0 and f(x 0 ) = jc 0 ). Then, if r is such that 
/(>’) = / + r, and if r is sufficiently small, the fact that f(y + nr) = 
{y + nr) + r will lead to a contradiction to the continuity of / and x (see 
Figure 6.1). 

A formal proof goes as follows. Suppose, as above, that jc 0 is a boundary 
point of S and x is such that f(x) x. Let e = |/(jc) - xj. Because / is 
continuous at x , there is a 5 > 0, and we may assume that 5 < |r, such that 
\z - x| < 6 implies |/(z) -/(x)| <\e. Because / is continuous at x 0 , there 
is an 17 > 0, 17 < 5, such that |w — x 0 | < 17 implies |/(w) — /(jc 0 )| < 5. 

Now choose y e (jc 0 - 17 , x 0 + 17 ) such that f(y) y=y (such a y exists 
because x 0 is a boundary point of S). Then 

0 < I f(y)-y\ < 1/(20 -/(jc 0 )I +l/(*o) -y\ 

= 1/(7) -/(*o)l + 

< 6 + Tf 

<26. 

Let r = /(_>’) — y (note: r may be negative). Since 0 <M < 26, there is an 
integer n such that y + nr E (x — 8,x + 8). But we know that f(y + nr) 




196 


6. Intermediate Real Analysis 


= (y + nr) + r. It follows that 

* = l/(*) - *1 

<\f(x)~Ky+»r)\ + U(y+*r)-x\ 

+ |(y + nr) + r- x| 

<i« + l(^ + w)-Jc| + M 
<^e + S + 25 
<i^ + + §£ 

= e. 

This contradiction means that S = R and the solution is complete. 

Two of the most important facts about continuous functions over closed 
intervals [a, 6 ] are that they have maximum and minimum values on the 
interval and take on every value between these two. This is the content of 
the following two theorems. 

Extreme-Value Theorem. If f is a continuous function on [a,b\, then there are 
numbers c and d in [a,b\ such that /(c) < f(x) < fid) for all x in [a,,b\ (that is to 
say, fid) is the maximum value for f over [a, b], and /(c) is the minimum value). 

Intermediate-Value Theorem. If f is a continuous function on [a,b] and iff (a) < y 
<f(b) ( or,f(b)<y < /(<a)), then there is a number c in (a, b) such that /(c) = y. 

These results can be proved in a variety of ways; we will sketch a proof of 
the intermediate-value theorem which makes use of a methodology (re¬ 
peated bisection) that is applicable in other problems (e.g., see 6.3.6). 

Suppose that/ is a continuous function on the closed interval [a, h], and 
suppose that /(a) <f(b) (a similar proof can be given if f(a) > /(*))■ Let 
y e [/(a), We wish to find an element c in [a,b] such that /(c) = y. 
The procedure goes as follows (a diagram will help). Let a 0 = a, b 0 = b , and 
let x, denote the midpoint of the interval \a t b\ (the first bisection). If 
/(x,) < y, define a, = x,, b, = b, whereas if y < /(x,), define a, = a, h, 
= x r In either case we have/(«i) ^ y < /(£>,), and the length of [«),£>,] is 
one-half the length of [a, b\ 

Now, let x 2 denote the midpoint of [a,, 6,] (the second bisection). If 
/(x 2 ) <y, define a 2 = x 2 , b 2 = b lt and iiy < /(x 2 ), define a 2 = a lt b 2 ~ x 2 . 
Again, it follows that f(a 2 ) < y < f(b 2 ), and b 2 — a 2 = (b — a)/ 4. 

Continue in this way. The result will be an infinite nested sequence of 
closed intervals 

[a 0 ,6 0 ] D [ fl i A] =>[«2 A] D ‘ ’ ’ D ‘ ‘ ' 

whose lengths converge to zero (in fact, b, — a t = (b — a)/2‘). These condi¬ 
tions imply that {a,} and {£>,■} each converge to the same real number in 
[a,h]: call this number c. 
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By the continuity of/, lim^ /(«,) =/{c) and lim, WoD Fur¬ 

thermore, for each /, /(a/ < y < J(b,), and therefore (by the squeeze 
principle, which will be treated in Section 7.6), 

/(c) = lim /(a,) < y < lira f(b i )=f(c). 

/—> GO /—> 00 

It follows that /(c) = y, and the theorem is proved. 

The proof of the extreme-value theorem can be carried out in similar 
manner and is left as a problem (6.1.5). 


Problems 


6.13. Suppose that *f is bounded for a < x < b and, for every pair of 
values Xpjtj with a < x l < x 2 < b, 

f(H x i + *a)) <r(/(*i) +/(*i))- 

Prove that/ is continuous for a < x < b. [Hint: Show that f(x + 8) — f(x) 

< 'jlf(x + 2S)-/(jc)] < • • • < (l/2 n )[/(x + 2*8) — /(at)], a < x + 2 n 8 

< b. Let 5^0.] 

6.1.4. A real-valued continuous function satisfies for all real x and y the 
functional equation 

/(V* 2 +y 2 )=/(*)/(y). 

Prove that f(x) = [/(1)]* S . [Him: First prove the theorem for all numbers of 
the form 2"^ 2 where n is an interger. Then prove the theorem for all 
numbers of the form m/2 ", m an integer, n a nonnegative integer.] 

6.1.5. Use the method of repeated bisection to prove the extreme-value 
theorem. 


6.1.6. Let /(0)>0, /(1)<0. Prove that f(x) = 0 for some x under the 
assumption that there exists a continuous function g such that / + g is 
nondecreasing. (Hint: Use repeated bisection—choose the right half of the 
interval if there is a point x in it such that f(x) > 0 , otherwise choose the 
left half. This yields a nested sequence of intervals [a i ,b^\Z}[a 2 ,b 2 \~D ■ ■ ■ 
which converge to a point c. Note that for each n there is a point y„ in 
[a„,c] such that/(y„) > 0. Prove that/(c) = 0.) 

6.1.7. Let / be defined in the interval [0,1] by 



if x is irrational, 

if x = p/q (in lowest terms). 


(a) Prove that/ is discontinuous at each rational number in [0,1]. 

(b) Prove that/is continuous at each irrational number in [0.1]. 
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6.1.8. IF x is an element of the Cantor set K (see 3.4.6), it can be expressed 
uniquely in the form 



where b„ = 0 or 1. Define g : —> [0,1] by setting 

°° b 

T*- 

Now extend g to [0,1] in the following manner. If r£[0,l] is not in the 
Cantor set, then, using the notation of 3.4,6, there is a unique integer « such 
that x £ /„, where I„ = (X„,Y n ), X n and Y n in K. Define g(x) = g(T„). 
(Note that for all n, g(X n ) = g( Y n ), and thus we have simply made g 
constant on the closed interval [X n , TJ.) Prove that g is continuous. (Also, 
see 6.2.13.) 


Additional Examples 

6.3.1, 6.3.5, 6.3.6, 6.4.3, 6.7.2, 6.7.7, 6.8.9, 6.8.10, 6.9.5. Continuity is an 
underlying assumption in most of the examples in Chapter 6 ; in particular, 
see Section 6.2 (intermediate-value theorem). 


6.2. The Intermediate-Value Theorem 


The intermediate-value theorem states that if / is a continuous function on 
the closed interval [a, ft] and if d is between /(a) and /(ft), then there is a 
number c between a and b such that f(c) = d. The power of the theorem 
lies in the fact that it provides a way of knowing about the existence of 
something without requiring that it be explicitly found. 

As an example, let us show that -2x 5 + 4x = 1 has a solution in the 
interval (0,1). Consider f(x) = -2* 5 + 4x, and take two “pot-shots”: /(0) 
is too small, and /(l) is too large. Therefore, by the intermediate-value 
theorem, there is a number in (0,1) that is just right. 


6.2.1. A cross-country runner runs a six-mile course in 30 minutes. Prove 
that somewhere along the course the runner ran a mile in exactly 5 minutes. 


Solution. Let x denote the distance along the course, measured in miles 
from the starting line. For each x in [0,5], let f{x) denote the time that 
elapsed for the mile from the point jc to the point x + 1. The function / is 
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continuous. We are given that /(0) + /(l) + /(2) -I- /(3) 4- /(4) -l- /(5) = 30. 
It follows that not all of /(0), .... /(5) are smaller than 5, and similarly, not 
all of/(0), . .. ,/(5) are larger than 5. Therefore, there are points a and b in 
[0,5] such that f(a) < 5 < f(b). Thus, by the intermediate-value theorem, 
there is a number c between a and b such that f(c) = 5; that is to say, the 
mile from c to c + 1 was run in exactly 5 minutes. 


6.2.2. Suppose that/: [a ,&]-» R is a continuous function. 

(a) Mean value theorem for integrals. Prove that there is a number c in \a,b\ 
such that f(t)dt=f(c)(b - a). 

(b) Prove there is a number c in [a,b] such that j' a f(t)dt = f(l)dt. 
(Note: For this, it is enough to know that / is integrable over [a, 6].) 

Solution, (a) Let M and m be the maximum and minimum values of / on 
[a, b\ respectively (guaranteed to exist by the extreme-value theorem), and 
let A = j h a f(t)dt. The intuition for the argument which follows is shown 
(for the case of a positive function/) in Figure 6.2. As the liney = k moves 
continuously from y = m to y = M, the area A ( h ) in the rectangle bounded 
by y = A, y = 0, x = a, x = b moves from being smaller than A (at A (m)) 
to being greater than A (at A (A/)). Algebraically (and true independently 
of the “area” interpretation), A(m) = m(b - a) < f{t)di < M(b - a) 
= A (M). Since A(k) = h(b — a) is a continuous function of A, it follows 
from the intermediate-value theorem (note: A ( h ) = b(b — a) is a continu¬ 
ous function) that there is a point d such that A (d) = A ; or equivalently, 
d(b — a) = A. But d is between m and M, so again by the intermediate- 
value theorem, since/ is continuous, there is a point c in [ a,b] such that 
/(c) = d. It follows that 

f 6 /(r)^ = /(c)(f> - a). 

m/a 

(b) Again, the intuition is shown in Figure 6.3 (for the case of a positive 
function). Let A = f{t)dt, and let A (h) = f{t)dt. In the figure, for 
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a < A < b, A(h) represents the area bounded by y = /(.v), y =0, x = a, 
x — A (shaded). The problem asks us to find a point c such that A{c) = {A. 
It is clear that as the vertical line x = h moves to the right from x = a to 
x = b, the corresponding integral (area) will move from 0 to A, and must 
therefore pass through \A at some point. 

The preceding argument is perfectly valid provided we prove that A (A) is 
a continuous function of A. To see this, note that 

A(h + x)~A(h)= f h + X /(t)dt. 

From part (a), we know there is a point c x between A and h + x such that 

( h + x f(t)dl-c x \x\- 

Jh 

Therefore, 

lirn [A (h + x) - A (A)] = lim cj*| = 0. 

(Note: c x is bounded because f is integrable.) Thus, A (A + x)-^ A (A) as 
*^0, and this means that A (A) is continuous at A. 


6.23. Let A be a set of In points in the plane, no three of which are 
collinear. Suppose that n of them are colored red and the remaining n blue. 
Prove or disprove: there are n closed straight line segments, no two with a 
point in common, such that the endpoints of each segment are points of A 
having different colors. 


Solution. This problem was considered in 1.11.2, but it is instructive to see a 
proof based on the intermediate-value property. 

We shall prove that the result holds by induction on n. Certainly the 
property holds when n = 1. Suppose the result holds when « = 1,2,. . ., k. 
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and consider a set A with 2 (k + 1) points, no three of which are collinear, 
such that k + 1 are colored red and k + 1 are colored blue. 

Suppose that two vertices of the convex hull of A have different colors. 
Then, there are two consecutive vertices along the perimeter of the convex 
hull, say P and Q, that have different colors. By the inductive assumption, 
the set of points A — { P, Q) may be connected in the desired way. None 
,of these segments will intersect the line segment PQ because of the way P 
and Q were chosen, and therefore, the result holds for the set A. 

It remains to consider the case in which all the vertices of the convex 
hull have the same color, say red. If L is any nonhorizontal line in the 
plane, let B(L) denote the number of points of A to the left of L that are 
colored blue, let R(L) denote the number of points of A to the left of L that 
are colored red, and let D(L) = B(L) — R(L). Now choose a nonhorizontal 
line L that lies to the left of all the points in A and which is not parallel to 
any of the line segments that can be formed by joining points of A. In this 
position, D{L) = 0. As L moves continuously to the rght, it will encounter 
points of A one at a time, and in passing such a point, D(L) will change 
+ 1 if the point is colored blue and — 1 if the point is colored red. As L 
moves to the right, its first nonzero value will be negative (obtained just 
after passing the first point of A). Since the last-encountered point of A is 
also red, its last nonzero value will be positive (obtained just before passing 
the last point of A). 

It follows from these observations that D{L) will equal zero somewhere 
between the first and last encountered points of A (note that D(L) is an 
integer-valued function). When L is in such a position, the inductive 
assumption can be applied to the points to the left of L and also to the 
points to the right of L. Since none of the resulting segments will intersect, 
the result follows for the set A, and by induction, the proof is complete. 


Problems 

6.2.4. Suppose /: [0,1 ] —^[0,1] is continuous. Prove that there exists a 
number c in [0,1] such that /(c) = c. 

6.2.5. A rock climber starts to climb a mountain at 7:00 a.m. on Saturday 
and gets to the top at 5:00 p.m. He camps on top and climbs back down on 
Sunday, starting at 7:00 a.m. and getting back to his original starting point 
at 5:00 p.m. Show that at some time of day on Sunday he was at the same 
elevation as he was at that time on Saturday. 

6.2.6. Prove that a continuous function which takes on no value more than 
twice must take on some value exactly once. 
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6.2.7. Prove that the trigonometric polynomial 

+ a,cosjc + • • • + a n co$nx , 

where the coefficients are all real and |a 0 | + |a ,| + * • • + |a„_ ,| < a„, has 
at least 2 n zeros in the interval [ 0 , 2 ir). 

6.2.8. Establish necessary and sufficient conditions on the constant k for 
the existence of a continuous real-valued function f(x) satisfying /(/(■*)) 
= kx 9 for all real x. 

6.2.9. 

(a) Suppose that f\[a,b\->R is continuous and g:[a,b]^> R is integrable 
and such that g(x) > 0 for all x e \a, b\ Prove that there is a number c 
in [a, b\ such that 

f h f( x )g( x ) dx= f( c ) C f(x)dx. 

J G Ja 

(b) Suppose that /:[«,£>]-»R is increasing (and therefore integrable), and 
g‘.[a,b]^>R is integrable and such that g(x) > 0 for all x£[d,li], 
Prove that there is a number c in [a, b\ such that 

P/(*)*(*) r g(x)dx + f(b ) f b g{x)dx. 

Ja Ja Jc 

6.2.10. Let/: [0,!]-»/? be continuous and suppose that/(0) = /(l). Prove 
that for each positive integer « there is an x in [ 0,1 — l/«] such that 
/(-*)=/(*+ !/")• 

6.2.11. A polynomial P(x) of degree at most 3 describes the temperature 
of a certain body at time t. Show that the average temperature of the body 
between 9 a.m. and 3 p.m. can always be found by taking the average of the 
temperature at two fixed times, which are independent of which polynomial 
occurs. Also, show that these two times are 10:16 a.m. and 1:44 p.m. to the 
nearest minute. [Hint: Use the mean-value theorem for integrals; see 
6 . 2 . 2 (a).] 

6 . 2 . 12 . For any pair of triangles, prove that there exists a line which 
bisects them simultaneously. 

6.2.13. Give an example of a continuous real-valued function / from [0,1] 
to [ 0 , 1 ] which takes on every value in [ 0 , 1 ] an infinite number of times. 
[Hint: One way to do this is to modify the continuous function defined in 
6 . 1 . 8 .] 


Additional Examples 

6.1.6, 6.5.2, 6.5.3, 6.5.4, 6.5.13, 6.6.4, 6.6.5, 6.6.6, 6.6.9, 7.6.13. 
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6.3. The Derivative 


The derivative of /: [a, b\ - 


R at a point x in (a, b ) is defined by 
f(x+h)-f{x) 


f'(x) = ]im 
J v ; a-o ft 

provided this limit exists. We note that if f has a derivative at x, then f is 

continuous at x, because 


lim [ f(x + h) — f(x) 1 = lim 
h- k> >r->0 


= lim 
0 


/(* + *) ~/(*) 

| /(x + A)-/(x) j 



lim/i 

k-*Q 


- 0 . 


6.3.1. If the function x/(x) has a derivative at a given point x 0 ¥= 0, and iff 
is continuous there, show that f has a derivative there. 


Solution. Let 


L = lim 

x->xo 


xf(x) - x 0 /(x 0 ) 
X - X 0 


The limit on the right exists, since it represents the derivative of x/(x) at the 
point x 0 (in the definition of the derivative given above, we have substituted 
x - x 0 for k). For x sufficiently close to, but different from, x 0 (and 
therefore different from zero), 

xf(*) ' *o/(*o) 

/(*)-/(*<>) = _ *o 

x - x 0 x — x 0 

_ XXq /(*) - xx 0 f(x 0 ) 
xx 0 (x - x 0 ) 


xx 0 f(x) - x 2 /(x) - xx 0 /(x 0 ) + X 2 f(x) 
xx 0 (x - x 0 ) 

*/(*)(*0 - X) + *(*/(*) - X 0 /(X 0 )) 
XX^X - x 0 ) 


1 / xf(x) - x 0 /(x 0 ) \ f(x) 
x o\ X- x 0 ) x 0 
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It follows that 


/'<■%> = Jim 


/(*) “ f( x o) 


X — Xq 

1 ( *!(*) - *o/(*o) \ f(x) 


= lim 


1 / */(*)-*o/(*o) \ _ /W 
*0 \ * - *0 ) X0 

-± lim f X « X) - \ - x Um /( 

Xq *“**<>l X — Xq f X 0 X“>XO ' 


= f [W(* o)]- 

The fact that lim*.,* f(x) = f (x 0 ) follows from the hypothesis that / is 
continuous at x 0 . However, this assumption is not necessary, because 

/(*)-/(*<>) = xx of( x ) ~ xx of( x o) 

x-x 0 xx 0 (x - x 0 ) 

_ xx oft*) - - v o/(-*o) - **o/(*o) + xlf(x o) 
x*o(* - x„) 

1 | */(*) ~ *o/(*o) ^ f(* o) 


x - Xn 


Using this we find that 


/w . lim (MzZW) 

v ' x-txu 1 X — X Q f 

= lim ( I / ~ ^oA^o) \ _ /(*o) \ 

x ^ x — x 0 J x I 


= ^-[L-/(x 0 )]. 
*0 


63 . 2 . Let /(x) = a { sinx + a.,sin2x + • • • + a„sin«x, where a lt a 2 , , 
a n are real numbers and where n is a positive integer. Given that |/(x)| 
< |sinx| for all real x, prove that \a y + 2a 2 + • ■ ■ + na„\ < 1. 


Solution, We gave an induction proof of this problem in 2.4.4; however, a 
more natural approach is based on noticing that /'(x) = a t cosx + ■ • • + 
/irr„cosnx, from which we see that /'( 0 ) = a ] + 2a 2 + ■ • • + na n (which is 
the left side of the inequality we wish to prove). This prompts the following 
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argument: 


l/'(0)l - j™, 

/(*) 

X 

= lim 

/(*) 


X 

< lim 

sinx 

x-»0 

X 

= 1 , 



and this completes the proof. 


633. Let / be differentiable at x = a, and f(a) ¥= 0. Evaluate 

f(a + 1 /m) 


lim 


/(*) 


Solution. It suffices to evaluate 


lim 


/(g + x ) 


1 l/JV 


For x small enough, f{a + x) and f{a) have the same sign, and it follows 
that 


log 


f( a ) j \ l/( a )l / 


log|/(a + x)\ - log|/(a)| 
= lim - 


x^O X 

The last expression on the right is the definition of the derivative of 
log|/(.x)| at x = a, which we know from calculus is f'(a)/f(a). Thus, 

/(« + *) llA 


lim 

X ~»0 


/(«) 




Problems 

63 . 4 . 

(a) Suppose that instead of the usual definition of the derivative, which we 
will denote by Df(x), we define a new kind of derivative D*f(x) by the 
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formula 


£>*/(*) = lim 


f\x + h) - f\x) 
h 


Express £>*/(x) in terms of Df(x). 
(b) If / is differentiable at x, compute 


lim 

h^O 


f(x + ah) — /(x + bh) \ 

h ) 


(c) Suppose/is differentiable at x = 0 and satisfies the functional equation 
f(x +/) = f(x) + /(/) for all x and y. Prove that / is differentiable at 
every real number x. 


63.5. Define / by 


/(*) 


2 ■ 1 

x sin - 
x 


if x # 0, 
if x = 0. 


(a) Show that /'(x) exists for all x but that it is not continuous at x = 0. 
(The derivative for x ^ 0 is 2xsin(l/x) — cos(l/x); what is the deriva¬ 
tive at 0 ?) 

(b) Let g(x) = x + 2/(x). Show that g'(0) > 0 but that / is not monotonic 
in any open interval about 0 . 

63 . 6 . Let /: [0, l]-» R be a differentiable function. Assume there is no 
point x in [0,1] such that /(x) = 0 = /'(x). Show that / has only a finite 
number of zeros in [0,1]. [Suppose there are an infinite number. Either [0,£] 
or [£, 1 ] contains an infinite number of these zeros (perhaps both will). 
Choose one that does, and continue by repeated bisection. Along the way, 
construct a convergent sequence of distinct zeros. Use this to reach a 
contradiction.] 

63 . 7 . Prove that if / is differentiable on («,h) and has an extremum (that 
is, a maximum or minimum) at a point c in ( a,b ), then /'(c) = 0. [For 
applications of this result, see 6.4.1, 6.4.2, 6.4.5, 6.4.6, 6.4.7, 6.6.4, 7.4.1.] 


Additional Examples 
6.6.2, 6.7.2, 6.9.1, 7.6.2. 


6.4. The Extreme-Value Theorem 


An existence theorem is a theorem which states that something exists (for 
example, a point within the domain of a function which has some stated 
property). Quite often this special object occurs at some “extreme” position. 
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It is in this way that one comes to make use of the extreme-value theorem: 
If / is a continuous function over a closed interval [a,ft], there are points c 
and d in [a, ft] such that/(c) < f(x ) < /(d) for all x in [a, ft]. 


6.4.1. Suppose that f :[a,b]^>R is a differentiable function. Show that/' 
satisfies the conclusion of the intermediate-value theorem (i.e., if d is any 
number between /'(a) and /'(ft), then there is a number c in the interval 
(«,ft) such that f'(c) = d). 


Solution. If /' were continuous we could get the result by a direct applica¬ 
tion of the intermediate-value theorem (applied to/'). However,/' may not 
be continuous (for example, see 6.3.5(a)), so how are we to proceed? 

To help generate ideas, consider Figure 6.4. In this figure, a line L of 
slope d is drawn through the point, (a, /(a)), where /'(ft) < d < f'(a). For 
each point x in [a,b], let g(x) denote the signed distance from the point 
f(x) to the line L (the length of AB in the figure). Our intuition is that the 
point we seek is that point which maximizes the value of g. We shall show 
that this is indeed the case, but to simplify the computation we look at a 
slightly different function. 

For each x in [a,ft], let h(x) denote the signed distance of the vertical 
segment from the point (x, f(x)) to the line L (the length of BC in the 
figure). We observe that the point which maximizes the value of h on [a, ft] 
is the same as the point which maximizes the value of g on [a, ft]. (This is 
because g(x) = A(x)cosa, where a is the inclination of L.) The advantage 
of considering A(x) is that we can easily get an expression for it in terms of 
f(x) and the equation of L. 

So now return to the problem as stated, and consider the function 
h(x) -/(x) ~[/(a) + d(x - «)]. 
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We see that 


*'(*)" A*)-* 

Since f'(b) <d< /'(a), we have h'(b) < 0 < h'(a). These inequalities 
imply that neither h(a) nor b{b) is a maximum value for h on [a,b] (this is 
a consequence of the definition of the derivative). Therefore, since h is 
continuous on \a y b\ the extreme-value theorem says that h takes on a 
maximum value at some point c in (a, b). At this point, by 6.3.7, h'(c) = 0, 
which is to say, f'(c) = d. 

A similar argument can be made if/'(a) < d < f'(b). In this case, h takes 
on a minimum value at some point c in {a, b), and at this point, f r (c) = d. 


6 . 4 , 2 . P is an interior point of the angle whose sides are the rays 0,4 and 
OB, Locate X on OA and Y on OB so that the line segment X Y contains P 
and so that the product of distances ( PX){PY ) is a minimum. 


Solution. The situation is illustrated in Figure 6.5. 

The problem is typical of the “max-min” problems encountered in 
beginning calculus: it does not ask “Is there a minimum value?,” but 
rather, “Where does the minimum value occur?.” The technique is to apply 
the result of 6.3.7: if the minimum is in the interior of an open interval, it 
will occur at a point where the derivative is zero. Thus, we need to express 
( PX)(PY ) as a function of a single variable, and find where it has a zero 
derivative. 

For each positive number x, there is a unique point X on OA such that 
x = \OX |, and this point in turn determines a unique point Y on OB such 
that X, P, and Y are collinear. Thus, (/ > A')(/’F) is a function of x. How¬ 
ever, an explicit expression for this function is very messy; perhaps there is 
another way. 

Notice that ( PX){PY) is uniquely determined by the angle y (see Figure 
6.5). To obtain an explicit expression for (PX)(PY), first use the Law of 



Figure 6.5. 
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Sines in A OXP and A OPY to get 

sin« = siny sin ft = sin(^-~ a - ft - y) 

PX OP PY OP 

Then, it follows that 

F(V) -(PX^PY) - ( ) - (OP) ■ (- (j „ _ T) ) • (O^) 

= C(cscy)(csc (77 - « — ft — y)), 0 < y < it 

where C =■ sin« sin fi{OPf is a constant. 

The function F is continuous and differentiable on (0, 7 r), and F(y)—> cc 
as y-»0 + and as y and therefore F will take on a minimum value at 

a point in (0,w). At this point F'(y) = 0; that is 

0 = cscy CSC(tt - « — ft - y)[coty — Cot(77 - a - ft — y)]. 

Since neither cscy nor csc(tt - a — ft — y) equal zero on (0,77), the mini¬ 
mum occurs when coty = cot(77 - a - ft - y). But this happens for 
0 < y < 77 and 0<77 — a — ft — y<7r only when y = 77 — a — ft — y. 
Thus, the minimum occurs when A OXY is an isoceles triangle; that is, 
when OX = OY. (For another proof, see 8.1.3.) 


Problems 


6 A3. 


(a) Let/: [a,b ]-» R be continuous and such that f(x) > 0 for all x in [a y b\ 
Show that there is a positive constant c such that f(x) > c for all x in 
[a,b\ 

(b) Show that there is no continuous function / which maps the closed 
interval [ 0 , 1 ] onto the open interval ( 0 , 1 ). 


6.4.4. Let f:[a,b]~* R be differentiable at each point of [a,b\ and suppose 
that/'(a) = f'(b). Prove that there is at least one point c in (a,b) such that 


m 


H c )~f( a ) 

c — a 


6.4.5. 


(a) Rolle’s theorem. Suppose/: [a,b]-> R is continuous on [a,b] and differ¬ 
entiable on (a,b). If f(a) = f(b), then there is a number c in (a,b) such 
that f'(c) = 0 . 

(b) Mean value theorem. If f:[a,b}—>R is continuous on [ a , b] and differen¬ 
tiable on {a,b), then there is a number c in (a,b) such that 


/(*)-/(«) 


/'(«)■ 


b-a 
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6.4.6. If A, B, and C are the measures of the angles of a triangle, prove 
that 


— 2 < sin 3A + sin 3 B + sin 3 C < 11/3 , 
and determine when equality holds. 

6.4.7. Given a circle of radius r and a tangent line L to the circle through a 
point P on the circle. From a variable point R on the circle, a perpendicu¬ 
lar PQ is drawn to L with Q on L. Determine the maximum of the area of 
triangle PQR. 


Additional Examples 

1.11.5, 6.6.1, 6.6.4, 6.6.5. 

6.5. Rolle’s Theorem 


One of the fundamental properties of differentiable functions is the follow¬ 
ing existence theorem. 


Rolle’s Theorem. Suppose f: [a, f>]-» R is continuous on [a,b\ and differentiable on 

(a, b). If f(a) = f(b), then there is a number e in (a, b ) such that f'(c) = 0. 

This result is a direct consequence of 6.3.7: For let c be a point in (a, b) 
such that/(c) is an extremum (such a point c exists by the extreme value 
theorem). Then by 6.3.7, /'(c) = 0. Rolle’s theorem is important from a 
theoretical point of view (we shall subsequently show that the mean-value 
theorem and a host of useful corollaries are easy consequences of Rolle’s 
theorem), but it is also important as a problem-solving method. 

6.5.1. Show that 4 ax 2 + 3 bx 2 + 2cx = a + b + c has at least one root 
between 0 and 1. 

Solution. Any attempt to apply the intermediate-value theorem (as in the 
solution to a similar problem in Section 6.2) leads to complications, because 
not enough information is given regarding the values for a,b,c. But 
consider the function f(x) = ax* + bx 1 + cx 2 — ( a + b + c)x. Notice 
that /(0) = 0 = /(I)- Therefore, by Rolle’s theorem, there is a point d in 
(0,1) such that f\d ) = 0; that is to say, d is a root of 4ax 3 + 3 bx 2 + 2cx 
= a + b + c, and the solution is complete. 
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6.5.2. Prove that if the differentiable functions / and g satisfy f'(x)g(x) 
,=/= g'(x)f(x) for all x, then between any two roots of f(x) = 0 there is a root 
Of g(x) = 0. 


Solution. Let a and b be two roots of f, a < b. The condition implies that 
neither a nor b are roots of g(x) = 0. Suppose that g has no zeros between a 
and b. Then, as a consequence of the intermediate value theorem, the sign 
of g on [ a,b ] is always the same (that is, g(x) > 0 for all x in [o,b\, or 
g(x) < 0 for all x in [«,£>]). 

Now consider the function F(x) = f(x)/g(x). This function is continu¬ 
ous and differentiable on [a,b] and F(a) — 0 = F{b). Therefore, by Rolle’s 
theorem, there is a point c such that F’(c) = 0. But this leads to a 
contradiction, since 

„„ , S(<0/'</) - g'(c)f(c) 

F’(c) =- z -—— 

g 2 (c) 

and, by supposition, g(c)f'(c) — g'(c)f(c) # 0. This contradiction implies 
that g must have a zero between a and b, and the proof is complete. 

A useful corollary to Rolle’s theorem is that if / is a continuous and 
differentiable function, say on the interval [a,b\, and if x, and x 2 are zeros 
of/, a < jcj < x 2 < b, then/' has a zero between j; , and x 2 . More generally, 
if / has n distinct zeros in [a, b\, then /' has at least n — I zeros (these are 
interlaced with the zeros of /), f" has at least « — 2 zeros (assuming /' is 
continuous and differentiable on [a, h]), and so forth. 


6.53. Show that x 2 = jcsinx + cos jc for exactly two real values of x. 


Solution. Consider f(x) = x 2 — xsinx — cos*. Then f( — w/2)> o, m 
< 0, and/(w/2) > 0, so the intermediate-value theorem implies that/has at 
least two zeros. If /has three or more zeros, then, by the remarks preceding 
this example, /' has at least two zeros. However, 

/'(x) = 2x — sin jc — jccosx + sinx 

= x[2 — cosx], 

has only one zero. Therefore,/has exactly two zeros and the result follows. 


63.4. Let P(x) be a polynomial with real coefficients, and form the 
polynomial 

Q (*) = (x 2 + O/W'OO + *[(n*)) 2 + 



212 


6. Intermediate Real Analysis 


Given that the equation P(x) = 0 has n distinct real roots exceeding 1, 
prove or disprove that the equation Q(.x) — 0 has at least 2n — 1 distinct 
real roots. 


Solution. Let a, ,a 2 . be « distinct real roots of P(x) — 0, where 

1 < a, < a 2 < ■ • ■ < a„, and write 0 (x) in the form 

Q(x ) = (x - 1 ) 2 P(x)P'(x) + x[ P{x) + PX*)] 2 . 

Suppose that P(x) has no zeros in the open interval (« ; ,tf, + 1 ), i=1, 
2, ... ,n~ 1. (There is no loss of generality here, for if there are more, say 
m, m > «, relabel the a’s to include these, and the following proof will 
show that Q has at least 2m - \ distinct real roots.) By Rolle’s theorem, 
there is a point in (o ; ,u I + l ) such that P'(^i) = 0. Since P is a polynomial, 
P'(x) = 0 has only a finite number of roots in («,, a (+l ), so for each i, we 
may assume that b i is chosen as the largest zero of P‘ in + 

Suppose that P(x) is positive for all x in (see Figure 6 . 6 ), and 

consider the function F(x) = P(x) + P'(x). Our idea is to find a point c, in 
(h,,a 1 + 1 ) where F(c,) < 0. Then, since F(p,)> 0, the intermediate-value 
theorem would imply that there is a point d, in {b ; ,c t ) such that F{d,) = 0 , 
and consequently, 

2(6,)=6,(m)) 2 ><x 

0(4) = (4- i)V(4)H4)<o 

(note that P'(x) < 0 for all * in (b,, a i + ,)), and 

0(u i+l ) = ^ +1 (F(a ; + ! )) 2 >O- 

Therefore, by the intermediate-value theorem, there are points x t in (h,,4) 
and 7 , in (d ; ,u ) + 1 ] such that 0 (x,) = O = 0 (/ ; ). 


P(x) 



Figure 6.6. 
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For the preceding argument to work we must show there exists a point c ; 
in (bj, <*,+ [) where F(c t ) < 0. If a j+ , is a root of multiplicity one, then 
F(a i+t )=P'(a i + l ) < 0, and the desired c, can be found in a sufficiently 
small neighborhood of a i+l . If a I + l is a root of multiplicity greater than 
one, then P(a j+ [) = 0 = P'( a ;+ ,) and there is an interval (a i+ , - S,a i+ [) for 
sufficiently small S > 0, where / > "(.v) > 0 (see Figure 6.7), For such an x, it 
is the case that 


„„ , , P (x) 

P'(x) <-=- 

V ' x ~ a i + 1 *-«; + ! 


and therefore. 


F(x) = P(x) + P'(x) 


<P(X) 
= P(X) 


1 + 


x — a, 


x - a i+l + 1 


X — a. 


‘i+l 


Therefore, let c, = x, where x is chosen sufficiently close to o, + , so that 
the numerator of this last expression is positive and the denominator is 
negative. Then, for such a c ; , F(c t ) < 0, b- < c, < a i+l . This completes the 
argument: Q(x) = 0 has two roots in (b t ,a l+ ,]. 

The preceding argument was based on the assumption that Ffx:) >0 for 
x in (a,-,fl i + l ). For the case in which P(x) < 0 for all x in + ,), an 
exactly analogous argument leads to the same conclusion. Thus, we have 
shown that Q has at least 2n —2 zeros (two in each of the intervals 
(a;,tf / + 1 ), i = 1,2, . . . , n — 1). The solution will be complete if we can show 
Q has a zero in { —oo,^). Again there are several cases to consider. 

Suppose that P'(x) = 0 has a root in the interval (0,a f ). Then, without 
going through the details again, the same arguments show that Q has a zero 
in ( b 0 ,cti ), where b 0 is chosen as the largest zero of P' in (0,«,). 

We are left to consider what happens if P'(x) = 0 does not have a zero 
in (0, fl[). If /’(x) > 0 for all x in (O,^,), then P’(x) < 0 for all x in (0,a,) 
and therefore Q(0) < 0 and Q(a |) > 0. By the intermediate-value theorem. 
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Q(x ) = 0 has a root in (0,^,). Similarly, if P(a) < 0 for all x in (0, a,), we 
get g(0) > 0 and 2( a i) < 0, etc. Thus, in all cases 2 (a) = 0 has at least 
2n — 1 distinct roots. 

The preceding analysis, though tedious and complicated, was based 
entirely on first principles: Rolle’s theorem and the intermediate-value 
theorem. With these two ideas the conceptional aspects of the proof are 
quite natural and easy to understand. There is another solution which is 
much easier going, after a clever, but not uncommon, key step (e.g., see 
6.5.11 and 6.9.4). Since it is instructive, we will consider it also. 

First, notice that Q can be written as a product in the following manner: 

2(A) = (a 2 + l)P(x)P'(x) + x[(P(a)) 2 + (P'(x)) 2 ] 

= [P'(x) + xP(x)J£xP’(x) + P(x)]. 

Let P(x) = P'(x) + xP(x) and G(a) = xP'(x) + P(a). The key step, as we 
shall see, depends on noticing that F(a) = e~* ^ 2 [e J£ ^ 2 P(a)]' and <7 (a) 
= [aP(a)]'. 

Assume that P(a) has exactly m distinct real zeros a t exceeding I, with 
1 < a t < a 2 < • ■ • < a m (m > n). Then e* ^ 2 P(a) also has zeros at a,, 
a 2i . . ., a m , so by Rolle’s theorem, [e* ^ 2 P(a)]', and hence also F(x), has 
at least m — 1 zeros b i with u, <b,<a i+ Similarly, by Rolle’s theorem, 
G(x) has at least m zeros, c 0 ,c l , .. ., c m _ ] , 0 < c 0 < a t , a t < c ; < a I + l , 
/ = 1,2, . . . , m — I. We will be done if we can show that b i # c t for 
/ = 1 , . .. , m - 1 . 

So, assume that for some i, b j = c ( , and let r be this common value. 
From F(r) = 0, we find that P’(r) = - rP{r). Substituting this into G(r) 
= 0, we get r[ - rP(r)\ + P(r) = 0, or equivalently, (r 2 — 1 )P(r) = 0. Since 
r > 1, the last equation implies P{r) = 0. But since a t , < r < a i+l , we then 
have a contradiction to our assumption concerning the roots of P(x) = 0 
(namely, a, and a I+1 were assumed to be consecutive roots of P; i.e., all the 
roots of P exceeding I were included among the n ( .’s). It follows that the b,' s 
and the c,’s are different, and therefore Q(x) = 0 has at least 2m — 1 
(> 2n — 1) distinct real roots. 


Problems 


6.5.5. 


(a) Show that 5 a 4 — 4a + 1 has a root between 0 and 1. 

(b) If a 0 ,a l , ...,«„ are real numbers satisfying 



m+ 1 


= 0 , 


show that the equation a 0 + a^x + ■ • - + a H x" = 0 has at least one real 
root. 
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6.5.6. 


(a) Suppose that /: [0,1]-* R is differentiable, /(0) = 0, and f(x) > 0 for x 
in (0,1). Prove there is a number c in (0,1) such that 

2 W _ fV~ c ) 

m /( 1 - 

(Hint: Consider f\x)f( 1 - x).) 

(b) Is there a number d in (0,1) such that 


3/'(*Q fV~ d ) n 
f(d) /(l - d) ' 


6.5.7. 

(a) Cauchy mean-value theorem. If f and g are continuous on \a,b] and 
differentiable on {a, b), then there is a number c in (a, b) such that 

[f( b ) “/(«)] S'( c ) = [ g( b ) ~ «(«)]/( c > 

(b) Show that the mean-value theorem (6.4.5(b)) is a special case of 
part (a). 

6.5.8. 

(a) Show that x 3 — 3x + b cannot have more than one zero in [—1,1], 
regardless of the value of b. 

(b) Let f(x) = (x 2 — l)e e *. Show that /'(■*) = 0 for exactly one x in the 
interval (— 1,1) and that this x has the same sign as the parameter c. 

6.5.9. How many zeros does the function f(x) = 2 X — 1 — x 1 have on the 
real line? 

6.5.10. Let/(*) = a 0 + a y x + ■ • • + a n x n be a polynomial with real coef¬ 
ficients such that / has n + 1 distinct real zeros. Use Rolle’s theorem to 
show that a k — 0 for 0 < k < «. 

6.5.11. If f : R^>R is a differentiable function, prove there is a root of 
f'(x) — af(x) = 0 between any two roots of f(x) = 0. 

6.5.12. Suppose n is a nonnegative integer and 

/(*) = c 0 e r ^ + Cl e r ' x + ■ • • + c„e'- x , 

where c i and r are real numbers. Prove that if / has more than n zeros in R, 
then/(x:) = 0. (Hint: Induct on n.) 

6.5.13. The nth Legendre polynomial is defined by 

where D " denotes the nth derivative with respect to x. Prove that P n (jc) has 
exactly n distinct real roots and that they He in the interval (—1,1). (Hint: 
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(x 2 — 1)" = (a — l/fx + l)' 1 . Show, by an inductive argument, that the A'th 
derivative of (x - l) fl (x + 1)" has 1 as a zero of multiplicity n — k, — 1 as a 
zero of multiplicity n — k, and at least k distinct zeros between - I and 1.) 


6.6. The Mean-Value Theorem 


Suppose that f:[a,b]->R is continuous on [a, b\ and differentiable on 
(a, 6). In a manner similar to that used in the solution to 6.4.1, consider the 
function 

F(x)~f(x)- L(x), 

(see Figure 6.8), where y = L{x) is the equation of the line from ( a , f(a)) to 
(b, f(b)). Geometrically, F(x) represents the signed distance along the 
vertical line segment from (x,/(x)) to the line y = Z.(x). Since F(a) = 0 
= F(b), we know from Rolle’s theorem that there is a point c in (a, b) such 
that F'(c) = 0. At that point,/'(c) - L'( c) = 0, or equivalently, 

f(b) -f(a) 

f'(c) = L'(c) = (slope of L) - —— . 

We have just proved the following. 


Mean-Value Theorem. Iff: [a, 6]-» R is continuous on [a, 6] and differentiable on 
(a, b), then there is a number c in (a, b) such that 


m-f{a) 

b ~ a 


/'(c). 


If /(a) = f(b), this is just the statement of Rolle’s theorem. Otherwise, it 
says that there is a point between a and b where the slope of the curve is 
equal to the slope of the line through {a, f(a)) and ( b, f(b)). 



Figure 6.8. 
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6.6.1. Let g(x) be a function that has a continuous first derivative #'(*) for 
all values of x. Suppose that the following conditions hold: 

(i) g(0) = 0, 

00 lg'(*)l < IsOOl for all jc. 

Prove that g(x) vanishes identically. 


Solution. We will give a rather unusual solution, simply to illustrate the use 
of the mean-value theorem. Begin by considering the interval [0,1]. Let x 
be an arbitrary point in (0,1]. By the mean-value theorem, there is a point 
C[ in (0, x) such that 




g(*) - g(°) 

x — 0 


It follows that |g(x)| = |xg'(<h)l = \x\ |g'(q)l < 1*1 lg( c i)|- 

Similarly, there is a point c 2 in (0,c,) such that | glcfl < |c,| |g(c 2 )|, and 
substituting this into the last inequality, |g(x)| < |x| |C[| g(c 2 )\. 

Continuing in this way, we are able to find numbers c t , c 2 , . ■ ■ , c„, 
0 < c„ < ■ ■ • < c 2 < C[ < x < !, such that |g(x)| < • * • 

|c„- 1 | | g(c„)\. Since g is continuous on [0,1], it is bounded (between its 
minimum and maximum values, which exist by the extreme-value theorem), 
and therefore, since the right side of this last inequality can be made 
arbitrarily small by taking sufficiently large n (each of the |c,|’s is less than 
1), it must be the case that g(x) = 0. Thus, g(x) is identically equal to zero 
on [0,1], 

The same argument can now be applied to the interval [1,2] (for a in 
(1,2) there is a c ( in (1 ,jc) such that |g(x)| < |* — 1||g(c t )|, etc.). As a 
consequence of this argument, we will get g(jc) identically zero on [1,2]. 

By an inductive argument, we will get g equal to zero on [n, n + I ] for all 
integers n. Therefore, g is identically zero. (Notice that we did not use the 
hypothesis that g' was continuous.) 


The mean-value theorem has a number of important corollaries which 
are useful in practice. Among these are the following. 


Suppose f and g are continuous on [«, b] and differentiable on ( a , b). 

(i) Iff'(x) = 0 far all x in ( a,b ), then f is a constant. 

(ii) If f'(x) — g’(x) for all x in ( a , b), then there is a constant C such that 
/(*) - g(x) + C. 

(iii) If f'(x) > 0 for all x in (a,b), then f is an increasing function. Similarly, 
iff(x) < 0 if'(x) > 0, ffx) < 0) for all x in ( a,b ) then f is decreasing 
(nondecreasing, nonincreasing, respectively ) on ( a,b ). [For applications, 
see Section 7.4.] 
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Proof of (i): Let x^(a,b). By the mean-value theorem, there is a 
number c in (a,*) such that [/(x) — /(«)]/[x — q] = /'(c) = 0. It follows 
that /(x) = f(a) for all x in (a, A). 

Proof of (ii); Apply (i) to the function A (a) = /(x) — g(x). 

Proof of (iii): Consider x, y e (a,b), x <y. By the mean-value theorem 
there is a number c in (x, _y) such that ( f(y) -/(x)]/[y - x] = /'(c) > 0, 
from which it follows that /( >’) >/(x), and / is increasing. 


6.6.2. Let /: R -*• R be such that for all x and v in R, |/(x) - /( v)| < 
(x — >’)\ Prove that / is a constant. 


Solution. By the first of the preceding corollaries, it suffices to show that 
/'(x) = 0 for all x. Therefore, we argue as follows; 

f(y)~f( x ) 


!/'(*)! = 


Hm 

7 “ 


= Hm 

y -+ V 


= lim 


y~* 

y-x 

\f(y)~f(x)\ 
\y “ x| 


... O'-*) 

< hm -- 

y~* x \y ~ x\ 

= limjj-xl 

= 0 . 


6.63. Suppose that f:R~rR is twice differentiable with /"(x) > 0 for all 
x. Prove that for all a and b,a < b. 


/(*¥)< 


/(«)+/(*) 


Solution. Figure 6.9 makes the conclusion believable, but it is the mean- 
value theorem that enables us to translate the local property, /"(x) > 0 
(/" at x is determined by those values of / close to x), into a global 
property (true for all a and b regardless of their proximity). 

By the mean-value theorem there is a number x, in (a,^(a + b)) such 
that 

f^(a + b))-f(a) 

= /(*!)> 


^(a + 6) - a 
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and a number x 2 in (4(a + b), b) such that 
f{b)~f{{{a + b)) 
b — £(a + b) 


A x 2 ). 


But f"(x) > 0 for all .* in (x h x 2 ), so/' is a nondecreasing function. Thus 
f(x 2 ) ? f'( x 1 )’ or equivalently, 

/(*)-/(*(« + *)) ^ /(i(«+ *))-/(«) 

b-a > b-a 


/(*(« + *))< 


m+m 

2 


In the remainder of the section we will consider problems which make 
use of all of the major existence theorems considered in this chapter: the 
intermediate-value theorem, the extreme-value theorem, Rolle's theorem, 
and the mean-value theorem. 


6,6.4, Let/be differentiable with / continuous on [a,b]. Show that there is 
a number c in ( a,b] such that /'(c) = 0, then we can find a number £ in 
(a, b) such that 


/'(£) = 


m -/(«) 

b — a 


Solution, We begin by getting a geometrical feel for the problem: consider 
the graph in Figure 6.10, where B is located so that the line CB is 
horizontal. For a point x between a and b, the right side of the equation, 
i.e., 

/w -m 

b-a ’ 
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represents the slope of the line AB, whereas the left side, f'(x), represents 
the slope of the tangent to the curve at C. 

Consider, then, the function 


^(*) = /'(*)- 


/(*)“/<«) 

b~a 


This is a continuous function of x (here we use the fact that /' is 
continuous), so by the intermediate-value theorem, there is a point £ in 
( a,b ) such that F(£) = 0 provided we can find points jcj and x 2 in (a,b) 
such that F(jc ( ) > 0 and F(_x 2 ) < 0. 

Observe that F(x) moves from being positive at a = a to being negative 
at x = c. Will this, or something similar, always be the case? 

Suppose that /(c) > f(a). Then /'(c) = 0, and [/(c) - }{a)\/[b - a] > 0, 
so that 


F(c) = /'(c)~ 


/(/)-/(<*) 

b — a 


< 0 . 


By the mean-value theorem, there is a point d in (0,c) such that 
f(d) = 1/(0 ~ /(a)]/[c - a]- Therefore, 


F{d)=f\d)~ 


f{d )-/<«) 

b-a 


1(d)-f{a) 

c ~ a b — a 


^ /(0-/H 

> b- a 

_ m-ftd) 

b — a 


f(d)-m 


b-a 
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Now, we would be done if it were the case that /(c) > f(d). Unfortunately, 
this may not be true, as the graph in Figure 6.11 indicates. 

To alleviate this difficulty, we can proceed as follows. Consider the 
function / over the interval [a, c]. By the extreme-value theorem, it attains a 
maximum value on this interval, say at x — s (s may equal c). Since we are 
assuming that /(c) > /(a), we know that a < s < c. If s = c then f'{s) 
= /'(c) = 0, whereas if a < s < c then f(s) = 0 by 6.3.7. Now proceed as 
before: There is a point d in (a,s) such that /'(d) = [ f(s) — /(«)]/ 
[s - a], and 


F(d)=f'(d)- 


m-m 

b - a 


f(s)-f(a) _ f(d) - f(a ) 
s — a b — a 


b — a b — a 


f( s )~f(d) 

b — a 


and this last expression is nonnegative, since f(s) > f(d) by our choice of s. 
This completes the proof for this case. The argument is similar for the cases 
f(c)<f(a) and/(c) =/(«). 


6.6.5. Suppose / is a twice continuously differentiable real-valued function 
defined for all real numbers such that |/(x)| < 1 for all x and (/(O)) 2 + 
(/TO)) 2 = 4. Prove that there exists a real number such that f(x 0 ) + 
/'Vo) = 0. 


Solution. There are two natural approaches we might consider. One is to try 
to apply the intermediate-value theorem: that is, to consider the function 
F(x) = /(jc) + f"(x) and to find a and b for which F(a) > 0 and F(b) < 0. 
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Unfortunately, it is hard to see how the condition (/(0)) 2 + (/"(0)) 2 = 4 
could be used in this approach. 

Another idea is to see if G(x) = ( f(x)) 2 + (f(x)) 1 has an extremum in 
the interior of some interval. At such an extremum, G'(*) = 0. Notice that 
G , (x) = 2f(x)f'(x) + 2f'{x)f"(x) = 2f'(x)[f(x) +f"(x)]. This looks more 
like it! 

Our approach will be to show that there are points a and b, — 2 < a < 0, 
0 < b < 2, such that |G(u)| < 2 and |G(f>)l < 2. Since G(0) = 4, it will 
follow that G(x) attains its maximum at a point jc 0 in (a, b), and at this 
point, G'{*o) = 0. 

From the mean-value theorem there is a point a in (— 2,0) and b in (0,2) 
such that 


f\ a ) = 

It follows that 


/( 0 ) /( 2 ) 


and /'(£) = 


/( 2 )~/(0) 


\m\ = 

\m\ = 


2 

/( 2 ) - ~ 0 ) 
2 


l/(0)l + l /(- 2 )l _ 

2 < —- 

l/(2)l + l/(°)l . 1 + | _ , 
i ■ 


Thus, 


lG ( a )| = |(/(^)) 2 + (/'( a )) 2 1 < |/(*) l 2 + |/'(«)| 2 < 2 , 

R*)l = \( f ( b )) 2 + (/'(*)) 2 | < |/(*)| 2 + \ m \ 2 < 2 . 

Let x be the point in (a, b) where G(x 0 ) is a maximum. Then 

G'(* 0 ) = 2 /'(*o)[/(*o) + /"{*o)] = 0. 

If /'(*o) = 0, then G(x 0 ) = (f(x 0 )f + (f(x 0 )f = (/(x 0 )) 2 < 1. But G(x 0 ) 
>4, since G (0) = 4. Therefore f'(x 0 ) ^ 0, and it must be the case that 
/(*„) + f"(x 0 ) = 0. This completes the proof. 


6.6,6. Let f(x) be differentiable on [0,1] with /(0) = 0 and /(l) = 1. For 
each positive integer n, show that there exist distinct points _x ( , x 2 , . , . , x n 
in [0,1] such that 



= n. 


Solution. To help generate ideas, consider the case n = 1. We wish to find 
jtj in [0,1] such that \/f'(x ] )= \. This is possible by the mean-value 
theorem, since on the interval [0, 1], there is a point jc ( such that/'fxj) = 1. 
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Consider the case n = 2. Consider the subintervals [0,.t] and [x\ 1] where 
a: is some number between 0 and 1 yet to be determined. By the mean- 
value theorem, there is an Xj in (0 ,jc) and x 2 in (x, 1) such that 


f\ x 0 

Thus, 


if and only if 


/(*) -m 

jc-0 


and 


f\ x i) = 


m-m 

i - x 


i 

f'( x o 



= 2 


* , 1 - x 

f{x) 1 - f(x) 


= 2 , 


*0 “/(*)) + 0 ~ X )A X ) = 2 f( x ) - 2 (/(*)) 2 ’ 


x - xf(x)+f(x) - xf(xj-2f(x) + 2(f(x)) z = 0, 


* - 2xf(x) ~f(x) + 2(f(x)) 1 - 0, 
*0 “2 f(x))-f(x)(l - 2/(jc)) = 0, 

[*"/(*)][> ~ 2 /(*)] =0 ‘ 


Now, had we chosen x in (0,1) so that/(x) = 4 (this could be done, by the 
intermediate-value theorem), the proof would be complete upon reversing 
the previous steps. 

With this background we can consider the case for an arbitrary positive 
integer «. Let c i be the smallest number in [0, I] such that/(c,) = ifn (the 
existence of this number is a consequence of the intermediate-value theo¬ 
rem together with the assumption of continuity). Then 0 < c ( < c 2 < ■ • • 
<c n _ ( <I. Define c o = 0 and c n — 1, and for each interval (c ,_,, c,), 
i = 1,2, . . . , n, choose x i such that 

1) 


(this can be done, by the mean-value theorem). Then 


A*/) = 


i i - 1 


n n 



1 

n ( c /- c,-i) ’ 


f'( x d 


2 1 ) = «. 

1 


so that 
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Problems 

6.6.7. 

(a) Show that 

sinx + sin(x + a 1 ) 

=- 

v } COSX — COS(X + d) 

is a constant function by showing that F\x) = 0. (This problem arose 
in 1.2.1.) 

(b) If /’(x) is a polynomial of degee three in x, and y 2 = P(x), show that 

£i/Dy) 

y 2 

is a constant, where D denotes the derivative operator. (Hint: first write 
the above expression in terms of P and its derivatives.) 

6.6.8. 

(a) If y = f(x) is a solution of the differential equation y" + y = 0, show 
that f 1 + (f ) 2 is a constant. 

(b) Use part (a) to show that every solution of y" +y = 0 is of the form 
y = A cosx + B sinx. (Hint: It is easy to show that all functions A cos x 
+ B sinx satisfy the differential equation. Let f(x) be a solution. For 
f(x) to have the form f(x) = A cos x + B sin x it is necessary that A 
= /(0) and B = /'(0). Now consider F(x) = f(x) — f(0)cosx - 
/'(O)sin.x. Apply part (a) to F(x), making use of the fact that 
F( 0) « 0 = F’{ 0). 

(c) Use part (b) to prove the addition formulas 

sin(x + y) = sin x cos y + cos x sin y, 
cos(jc -f y) = cos x cos y — sinx sin y. 


6.6.9. Let f(x) be differentiable on [0,1] with /(0) = 0 and /(l) * 1. For 
each positive integer n and arbitrary given positive numbers k^k^, .. ., k n , 
show that there exist distinct x lt x 2 , . . ., x„ such that 


2 

i ■■ I 


f( x d 



Additional Examples 
6.9.6, 6.9.10, Section 7.4. 
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6.7. L’HopitaFs Rule 


We will assume the reader is familiar with the various forms of L’Hdpital’s 
rule. 

6.7.1. Evaluate 

lim { — -—1 , where a > 0, a¥= 1. 
x a - 1 } 


Solution. Rewrite the expression in the equivalent form 




In this way the problem is transformed to that of evaluating 


lim 

>C© 


log 


I a x ~ 1 i 
x a — 1 


or equivalently, 


lim 


n°gj 


+ | im ton («=1\ 

*-*00\ x } X—*CG \ X ) 


provided each of these limits exists. 

Clearly, lim^^o - l)/x = 0, and by L’Hdpital’s rule, 

log(l/je) 


lim 

jf—*co 


x j 


= lim 

x— >C© 


* 


Also, by L’Hopital’s rule, 

log(a J ~ 1) 


lim ( 

x-*oo y 


It follows that 


1 of-j 


lim ( — ---11 = exp logfl = a. 

-»« V x a — 1 ) 


6.7.2. Suppose that / is a function with two continuous derivatives and 
/(0) = 0. Prove that the function g defined by g(0) = /'(0), g(x) = f(x)/x 
for x ^ 0 has a continuous derivative. 
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Solution. For x =£ 0, 


£'(*) : 


x f(x)-f(x) 


and since /' is continuous, so also is g' for all x ¥= 0. It only remains to 
check that g has a derivative at x = 0, and if g'(0) exists, to see if g' is 
continuous at x = 0. 

For the existence of g'(0) we must examine the following limit: 

j^o\ x — 0 j 


= tim ( 


/(x)/x-/'(0) 


) 


■ i im f jL-lziCWV 


Since /(x) — x/'(0)-»0 as x 0, and since / and /' are differentiable, we 
may apply L’Hopital’s rule to this limit to get 

“ 2 /"( 0 )- 

(The last step follows from the definition of /"(0).) Thus g'(0) exists. 

To check continuity of g' at 0 we have 

( x f( x )-f( x )\ 

hm g'(x) = lim -r—— 

— 0 V * 2 / 

I ft*)* 

■l'”l- T, -) 

The last step follows because we are given that / has a continuous second 
derivative. Thus Hm. [ .^ 0 g'(x) = g'(0), and the proof is complete. 


Problems 

6.73. Evaluate 


$ 



6.8. The Integral 
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6.7.4. Evaluate the following limits: 

(a) lira (l + -) 

v oo \ n ) 

< b > .fe(fri)" 

< c > + 

fd) lim f 1 + — \ 
y f w->oo V n } 

(') “a -£nr ■ whm f. - (' + i )"• and r - - (' + i )""'■ 

6.7.5. Let 0 < a < 6. Evaluate 

6.7.6. Calculate 

lim x fV ! “ A V*. 

Jo 

6.7.7. Prove that the function y = (jc 2 )*, y(0) = 1, is continuous at x = 0. 


-i 1 /' 


6.8. The Integral 


Consider that happens to the sum 

1 + _!_ + ... | 1 

n n + I 2n — 1 

as n->co. One way to think about this is to interpret the sum geometri¬ 
cally: construct rectangles on [«,2«] as shown in Figure 6.12. From the 
figure it is clear that 


- + + • * • + - 1 . > ( 2n -dx = logxl 

n n + 1 2n — 1 x J n 

= login — logrt = log2. 

Similarly, from Figure 6.13 it follows that 

—~ + —j-j + • ■ • + y- < [ 2 ”\dt-\ Q% 2. 

« +1 n + 2 2n t 
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Putting these together, we have 

log2 < — + —+ • • • + T <(- - + log 2. 

n h + 1 2« - 1 U 2 n} 

Now, as n-> oo it is apparent that the sum in question approaches log 2. 
Another way to see this is to rewrite the sum in the form 

and to think of each term, 

()+*/«)«’ 

as the area of the rectangle with base [k/n,(k + 1 )/n] and height 1/(1 + 
k/n). In this way, the sum represents the area in the shaded rectangles 



Figure 6.13. 
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shown in Figure 6.14. As n oo, these areas approach the area bounded by 
y~ 1/(1 + x),y = 0, x = 0, a: = 1. That is, 


n — l 

lim 2 — 


i-*® fc _oV 1 + k/n I n 



1 J^*= 1 °g2. 


6.8.1. Evaluate 


lim 




Solution. The problem asks us to evaluate definite integral 



We will do this geometrically by computing the area under the graph of 
f{x) = [2/x ] - 2 | 1 /jc J between x — 0 and x = 1. The points of discon¬ 
tinuity of f(x) in (0,1) occur at the points where either 2/x or 1 / x is an 
integer. In the first case, 2/x = n when x = 2/n, and in the second case, 
1 /jc = n when x = 1 /n. Thus, we concentrate on the points 1 >2/3 > 2/4 
> 2/5 >2/6 > • • • . 

It is easy to check that for each n , 

if * e ( 2nVr ]’ 
lf X S ( 2« + 2 ’ 21 ‘ 


/(*) = 
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Figure 6.15. 


The graph is as shown in Figure 6.15. The integral is therefore equal to 



Now recall that 

log(l + x) = x- y + ^- ^+ ’- -, - 1 < jc < 1. 

This means that 

2U-i + W+ -- ') = 2 [ log 2 - 1 + *] = log4 - 1, 
and this completes the solution. 

6.8.2, Evaluate 





6.8. The Integral 
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Therefore, we will examine 
r 2 n 


lim 

n —>co 


= lim 

» CO 


= lim 

ft —>00 


= lim 

ft —*CO 


= lim 

n—*<x> 


= lim 

ft —*00 


2 ^iog(n 2 + > 2 ) - i°g» 

l,» io8 " 2 (^)- 108 '' 

|,»{l° g " 1 + lo8 ( l + (») 2 )}-'° 8 " 

!, i io g » ! +1, i logi 1 +(i ) 2 ) - ‘o g « 

^ l08 " ! + ,l, » los ( 1 + (i) ! )- |OB ” 

!.S-(i) 2 )i 

We recognize this final expression as the definite integral 

J* 2 log(l + x 2 )t£>c. 


Using integration by parts, 

J* 2 |og(l + x 2 )<&: = xlog(l + x 2 ) — 2 j —-dx 


= 2 log 5 - 2 V 
Jo 


1 - 


1 + x 2 


dx 


= 21og5 — 2[x — arctanxjo 
= 21og5 — 2[2 — arctan2]. 

Thus, the original limit is 

exp[21og5 - 4 + 2arctan2], 

or equivalently, 

25 exp(2 arctan 2 — 4). 

6.83. Prove that 

= £ 0 (-')*(*) Frbn 
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Solution. The key is to observe that 

7-!-T = f l t k+m dt. 

k + m + I Jo 

Using this, we find that 


= fV(l - t) n dt. 

JO 

Now use a change of variable: let s = 1 — /. Continuing from the last 
integral: 

= J l ' s "(l - 

-I' *!<-»*(:)*** 

k{m\ 1 


- S(-1) 

A — 0 


k ) k + n + 1 


Problems 

6.8.4. Evaluate each of the following: 
( a) lim 

V f n—*c£ 


—1— + —1— + ... + _L 

2«+ 1 2n + 2 3n 


(b) lim 

' * H_s/m 


1° + 2 a + • • • + n a 

„l+a 


a > — 1. 


M .‘JS [,! + „! + 2 ! +n ! + " ' + „ 2 +„’ ]' 

< d > . 1 js,[( , + ;)( 1 + s)---( , + ;)r- 
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6.8.5. Evaluate each of the following: 


(a) lim n 3/2 2 Vfc * 


k - l 


( b ) Hm 2 


it * l 




6.8.6. Find the integral part of 21= i n 2/3 ■ (Hint: Compare the area under 
f(x) = x~ 2/1 over [1, 10 9 + 1] with the area under g(x) = (x — 1) _2/ ' 3 over 
[2, 10 9 + 1].) 

6.8.7. Suppose that/and g are continuous functions on [0,u], and suppose 
that/(x) =/(o — x) and g(x) + g(a — x) = k for all x in [0,a], where k is a 
fixed number. Prove that 

f7(-*)s(*)^= i k \ a R x ) dx - 

JO JO 

Use this fact to evaluate 


r * sin * dx. 

do | + cos 2 x 


6 . 8 . 8 . 

(a) Let 


Compute 


in terms of A. 
(b) Let 



cosx 
(x + 2) 2 


dx. 


L 


sin x cos x 
x + 1 


dx 


f{x)= ^Wi dt for * >o - 

Compute /(*) + /(l / x). 

6.8.9. Find all continuous positive functions /(*), for 0 < x < 1, such that 
/o f(x)dx = 1, Jo xf(x)dx = a, jlyX 2 f(x)dx = a 2 , where a is a given real 
number. 

6.8.10. Let f(x , >’) be a continuous function on the square 

S-{(xj):0<x<l 1 0</<l}. 
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For each point (a, b) in the interior of 5, let S {ab) be the largest square that 
is contained in S , centered at (a,b), and has sides parallel to those of S. If 
the double integral y)dxdy is zero when taken over each square 

S (a b) , must /(*, >') be identically zero on SI 


Additional Examples 

1.4.4, 1.6.3, 1.12.3, 1.12.6, 2.5.15, 6.2.2, 6.2.9, 7.6.3. 

6.9. The Fundamental Theorem 


The fundamental theorem of calculus refers to the inverse relationship that 
holds between differentiation and integration. The fundamental theorem 
for integrals of derivatives states that if F(t) has a continuous derivative on 
an interval [a, 6], then 

f h F'(t)dt= F(b) - F(a ). 

Ja 

In other words, differentiation followed by integration recovers the function 
up to a constant, in the sense that 

F(x) = fV(/)i+ C 

where C = F( 0). 

For example, the derivative of F(t) = sin/ is F’(t) = 2sin/cost. Integra¬ 
tion of F'(t) on [0,x] yields 

sin 2 * = f*2 sin /cost dt. 

In this case we have recovered the function exactly because F(0) = 0. But 
also observe that the integration can be carried out in another manner; 
namely (let u = cos t), 

X 

= — COS 2 * + 1. 

0 

It follows that sin 2 * = — cos 2 .v + 1, or equivalently, sin 2 * + cos 2 * = 1 for 
all *. 


r 

Jq 


2 sin t cos tdt = — cos 2 t 


6.9.1. Find all the differentiable functions/defined for * > 0 which satisfy 
/(*>>) ”/(*)+ /O')- X >F > 0. 
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Solution. When jc = y = 1, we get/(]) = /(I X 1) = /(l) + /(l), and it fol¬ 
lows that /(l) = 0. 

If x #0, we have 0 = /(])= f(x X \/x) = f(x) + f(\/x), and therefore, 
/(!/*) = It follows lhatf(x/y) = f(x) + f(l/y) = f(x)-f(y). 

Now the idea is to look at the derivative of f and then to recover / by 
integration: 


, , ( f( x + h )~f( x )\ 

- h -) 

= l im ( /((^ + k )/ x ) \ 

6—0 ^ h J 

{ fO + 0 \ 

= liml -——— j, where h/x = t 

1 /(I+ 0-/0) 


- limf 

/—y 


) 


Therefore, by the fundamental theorem, 


/(*) = f( x ) -/(’) = f*f'( x ) dxw ‘f* 


n i) 


dx = /'(l)logx. 


Thus, the functions we seek are those of the form f(x) = A log .r, where A is 
an arbitrary constant. 


6.9.2. Find the sum of the series 


J-I + 1-X + 

5 7 11 + 


1 


1 


6n — 5 6 m — 1 


Solution. Consider the function defined by the infinite series 


/(*)=*-t + T-TT + “ 


„6«-5 


^6 n — l 


■ + 


6m — 5 6m — 1 


for 0 < x < 1. The series is absolutely convergent for |.v| < 1, and therefore 
we can rearrange the terms: 

( „7 V L3 v 6n-5 \ 

* + T + lT + •••) 

- ( 2d + + . . . + x<1 ” 1 + . . . ) 

\ 5 11 6/1-I } 
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Our idea is to differentiate /, to change its form, and then to recover / by 
integration by use of the fundamental theorem. We have, for 0 < x < 1, 

/'(*) = (l + .* 6 + • ■- • ) - i x*+x ia + ■■■ + X 6 ”~ I + - ■ •) 

_ 1 JC 4 _ 0 - * Z X ] + * 2 ) 1 +x 2 

l-* 6 l-* 6 (1 - Jt 2 )(l + x 2 + x 4 ) \+x 2 + x 4 ' 


Integrating (the details are not of interest here), and noting that/(0) = 0, we 
get 




arc tan ^ — j + arctan| ^2L±1 j 


Since the series representation of f is convergent for x — 1, Abel’s theorem 
(see Section 5.4) implies that the original series converges to 


/(])= — 

^ } & 


arctan —+ arctani/3 

V3 


2V3 


The fundamental theorem for derivatives of integrals states that if/is a 
continuous function in an interval \a, b\, then for any x in (a.b) 

In other words, integration followed by differentiation recovers the function 
exactly. 


6.93. If u(jc), b(x), c(x), and d(x) are polynomials in jc, show that 
j*a ( x )c (x) dx J*b (d )rf( *) dx — ( x)d( jc) (x )c ( x ) dx 

is divisible by (x — l) 4 . 


Solution. Denote the expression in question by F(x). Notice that F(x) is a 
polynomial in x. Also, notice that / r (l) = 0 and therefore x — 1 is a factor 
of F(x). 

Because F is a polynomial, we know that (x — l) 4 is a root of F(x) = 0 if 
and only if F"'(1) = 0. We can compute F’ by use of the fundamental 
theorem: 

F'(x) = acj*bd+ bdj*ac— ad j* be - be j* ad. 

(Note that /”(!) = 0 and hence that (jc - l) 2 is a root of F{x) = 0.) The 
derivatives F" and F"’ are done in a similar manner; it turns out that 
F'"(l) = (ac)'bd + (bd)'ac — (adybe — {bc)'ad\ x „ x — 0. This completes the 
proof. 
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The next three examples combine several ideas from this chapter. 

6.9.4. Let/: (0, oo)-> R be differentiable, and assume that f(x) + f'(x)—>0 
when jc-» oo. Show that /(.*)-» 0 as x-> oo. 


Solution. First, a digression: If p(x ) and q(x) are continuous functions, the 
equation 


dy 

+ p{ x )y = q{ x ) 

can be solved in the following manner. Multiply each side of the equation 
by m(x) = e^ p<x)lix , and notice that the resulting equation can be put into 
the form 


-^(ym(x)) = m(x)q(x). 

Thus, by the fundamental theorem of calculus, for each constant a, there is 
a constant C such that 


ym(x) = f X m(t)q(t)dt+ C. 

From this, we can solve for y. 

Now, return to our problem and set g(x) = fix) + f{x). According to 
the reasoning of the last paragraph, we can solve for/(x) (in terms of g(A;)) 
by first multiplying each side by e x . As above, this leads to the equation 

f(x)e* = f x e'g {[ )dt+ C, 

or equivalently, 

/(*)= e -*fVg( f )dr+ Ce-*. 

Let e > 0. Since g{x )—>0 as x—>oo, choose o. so that |g(x)| < e for all 
x > a. Then 


l/(*)l < « " (* e ‘g(0dt 

•da 


+ Ce' 


<e- x r e '\g(t)\dt+\Ce- x \ 

Ja 


< se~ x f*e ! dt+ \Ce~ x \ 

= te~ x {e x — e a ) + \Ce~ x \ 

-B(l-e°- x ) + \Ce~*\. 

Now, for sufficiently large x, we will have |/(x)| < 2e. It follows that 
0 as x-> oo. 
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6.9.5. Evaluate 


lim (I/xjJ^l + sin 2() l/ 'r*. 


Solution. Our aim is to apply L’Hopital’s rule, but some preliminary work 
must be done. First, there is a question concerning the existence of the 
integral because the integrand is undefined at ( = 0. However, 


lim (1 + sin2x) l ^ ,c = lim exp — log(l + sin 2x) 


: exp 


lim ( 


log(l + sin2x) 


which by L’Hopital’s rule is 


exp 


lim 


2 cos 2x 
‘6 1 + sin 2x 


= exp 2 = e 2 . 


Thus, if we define 


/ (jc) =f( 1 + sin2j: ) l/ " if 

U 2 if X = 0, 

the function/ is continuous, and /□(] + sin2r ) l/ 'dt = 

In order to apply L'Hopital’s rule to this problem, we must show that 
/o(l 4- sin20 ,/( dt—>0 as x-»0. To do this, let K be an upper bound for 
|/(x)j for all x in (- 1,1). Then, for x in (- 1,1), 

j J*(l + sin 2t) l/ 'di < J"*|l + sm2t\ l/ 'dt < AT|jc|. 

It follows that 



+ sin2/) l/, dt^0 


as x —> 0. 


We are now able to apply L’Hopital’s rule to the original problem: 


f (1 + sin2 t) l/ 'dt 

lim —-= lim (I + sin2x) l/ * = e 2 . 

o x x-* 0 v / 


6.9.6. Suppose that /": [0 T 1 ] —> /? has a continuous second derivative, that 
/(0) = 0 =/(l), and that/(x) > 0 for all x in (0,1). Show that 


r i /*(*) 

Jo /(*) 


dx > 4. 
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Solution. Let X denote a point in (0,1) where f(x) is a maximum, and 
suppose that Y = f(X). Then 


Jl 


A*) 

/(*) 




> 



m-m 

Y 


We appear to be stymied at this point, because it is certainly not necessary 
that f'(Y) ~ f'(0) > 4| Y\. However, by the mean-value theorem, there are 
points a in (0, A") and b in (A", 1) such that 


/(«) 


/(*)-/(«) 

A" — 0 


/(*) 

X 


Y 

X 


and 


Thus, 


m = 


fO)-f(X) 

1 - X 


- Y 
1 - X 


r 

Jo 


/"(*) 

dx> f b 

f"(x) 

/(*) 

Ja 

m 


d,t> m 


so applying the fundamental theorem to the last integral, we have 


/' 


f" (*) 
f( x ) 


^ > T>i 


\y\ 

= j_ 

I Y\ 


x 

X 


- Y 
1 - A 

-J- + JL 

\ — X X 


xv-X) 

But the maximum value of x(l - x) in (0,1) is ^ (when x = j) and 


therefore 


r 

Jo 


A*) 

/(*) 


dx > 


1 


|A*(1 — A*)| 


> 4. 


Problems 


6.9.7. What function is defined by the equation 
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6.9.8. Let/:[0,1 ] —>(0,1) be continuous. Show that the equation 

lx - f x f(t)dt= 1 

Jo 

has one and only one solution in the interval [0,1], 

6.9.9. Suppose that / is a continuous function for all x which satisfies the 
equation 

where C is a constant. Find an explicit form for f(x) and find the value of 
the constant C. 

6.9. ID. Let C] and C 2 be curves passing through the origin as shown in 
Figure 6.16. A curve C is said to bisect in area the region between Cj and 
C 2 if for each point P of C the two shaded regions A and B shown in the 
figure have equal areas. Determine the upper curve C 2 given that the 
bisecting curve C has the equation y = x 2 and the lower curve C ( has the 
equation y = \ xl 

6.9.11. Sum the series l+j- 5 - 7 + 9 +Tr--j 3 - * * • ■ 

6.9.12. Suppose that / is differentiable, and that /'(*) is strictly increasing 
for x > 0. If /(0) = 0, prove that f(x)/x is strictly increasing for x > 0. 


Additional Examples 


1.5.1, 5.1.3, 5.1.9, 5.1.11, 5.4.6, 7.6.5. 



Chapter 7. Inequalities 


Inequalities are useful in virtually all areas of mathematics, and inequality 
problems are among the most beautiful. Among all the possible inequalities 
that we might consider, we shall concentrate on just two: the arithmetic- 
mean-geometric-mean inequality in Section 7,2 and the Cauchy-Schwarz 
inequality in Section 7.3. In addition, we shall consider various algebraic 
and geometric techniques in Section 7.1, and analytic techniques in Sec¬ 
tions 7.4 and 7.5. In the final section. Section 7.6, we shall see how 
inequalities can be used to evaluate limits. 


7.1. Basic Inequality Properties 


The most immediate approach for establishing an inequality is to appeal to 
an algebraic manipulation or a geometric interpretation. For example, the 
arithmetic-mean-geometric-mean inequality, 

>Jab , 0 < a < b, 

can be established algebraically by writing it in the equivalent form 

(Vo -V6 ) J > 0, 

or geometrically by considering the semicircle in Figure 7.1. (The semicircle 
is constructed with diameter AB of length a + b, and C is a point chosen so 
that AC = a and CB = b. A perpendicular to AB from C meets the circle at 
D. Triangles ACD and CDB are similar, and therefore a/CD= CD/b. 
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Figure 7.1. 


Il follows that CD =4ab ■ Clearly, iab < radius of circle = (a + b)/2.) 
Both derivations make it clear that equality holds if and only if a = b. 

In this section we will consider examples of inequalities that can be 
verified by using only algebraic and geometric ideas. 

7.1.1. Show that for positive numbers a, b, c, 

a 1 + b 2 + c 1 > ab + be + ca. 


Solution. Working backwards, 

a 1 + b 2 + c 2 > ab + be + ca, 

2a 2 + 2 b 2 + 2c 2 > 2 ab + 2£>c + lea, 

(a 1 — lab + b 2 ) + ( b 1 - 2be + c 1 ) + (c 2 — 2 ca + a 2 ) > 0, 

(a - b) 2 + (b - c) 2 + (e- a f> 0. 

This last inequality is obviously true, and since the steps are reversible, the 
solution is complete. (The proof also makes it clear that equality holds if 
and only if a = b = c.) 

This example illustrates a common theme: manipulate the expression 
into a form to take advantage of the fact that a squared number is 
non negative. 


7.1.2. Prove that for 0 < x: < \ it, cos 2 * + * sin * < 2. 


Solution. Consider the function 

f(x) = 2 — cos 2 * — * sin *, 
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and perform the following manipulations: 

/(*) = 1 + (1 - cos 2 *) - *$inx 

= I + sin 2 * — xsinx 
= (1 — 2sin* + sin 2 *) — *sin* + 2sin* 

= (1 - sinx) 2 + (2 — *)sin*. 

In this form we see that the desired inequality holds for 0 < * < 2. 


7,13. If 0 < a,b t c < 1, show that 


7 —^~rr +--—r + —r-T—r + (1 - a)( 1 - b)( 1 - c) < 1. 

b + c + 1 c + a + I a + b + l ' 


Solution. Here, straightforward algebraic expansion leads to horrendous 
and unenlightening complications. One simplification is to assume, without 
loss of generality, that 0 < a < b < c < 1. Then, for example, we have 

a _I _ b _ | _ c ^ a + b + c 

h + c+1 c + a + 1 a + b + \ a + b + 1 ’ 

and we might try to prove that 

^±|ie +(1 - a)( i-6 )(1 - c) < i. 

This problem is easier algebraically, but still messy, and of course, we may 
have given away too much (that is to say, this inequality may not even be 
true). However, we have the following: 


a + b + c 
a + b + 1 


+ (l-«)(I-h)(l- C ) 


_ a + b + 1 + c - 1 
a+h+1 a + b +I 


+ (l-a)(l-&)( I 


c ) 


The desired inequality follows from this expression after noting that 
(1 + a + b)( 1 — a)( 1 - b) < (1 + a + b + ab)( 1 — er)( 1 — b) 

= (1 +a)(l + 6)(1 - n)(l - b) 


= (l-« 2 )(l-h 2 ) 

< 1. 
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7.1.4. Let n be a positive integer, and a, > 1, for i = 1,2, .... n. Show that 
(1 + * 0(1 + a 2 ) ■ • ■ (1 + a n ) > + a, + ■ - ■ + «„)- 


Solution. Induction is a natural strategy here, and it is not difficult to carry 
out in this manner. But the following “give a little” argument is more fun: 


(l+B,)(l+0,)---(l+fl fl ) 

-1 




. a ■ 1 a*, — I a„ — 1 

> 2"11 + Ac— + + • ■ ■ + 


2"|l 


2 2 

a , — 1 «,— l 


2 

/, n, — 1 — 1 o„ - 1 \ 

> 2" 1 + —— r + —— r + ■ • • + - 
\ n + 1 n + 1 n + l } 


) 


2” 


1 +(J| - 1+^1 + ... + a„-\) 


2” 


= ;r+T (1+ ‘ Il + a2 + +«»)• 


7.1.5. For each positive integer n, prove that 


Solution, This is an important inequality that can be proved in a number of 
ways (see 7.1.11, 7.2,8, 7.4.18). Here we will give a proof based on 
comparing corresponding terms in the binomial expansions of each side. 
On die left side, 






it — o 


n ■ n ■ n ■ ■ ■ n 


A:! 
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In a similar manner, 


-(7hr 

The inequality is now obvious, since comparing the coefficients of I fk\ in 
these expressions, we see that for each k, k = 0,1,2, . . ., n. 

It is worth noting that 


Jt = 0 


k — I 


* — 1 2 *-o 2 

00 i 

< 1 +Si=3. 

o 2* 

Thus, the sequence (1 + 1 fnf is increasing and bounded above by 3. (It 
can be shown that the sequence converges to the number e,) 

The next result is important theoretically and is very useful (e.g., see 
7.4.9 and 7.4.20). 

7.1.6. Suppose that f:R->R satisfies 

f (z±y) < m±m 

for all x and_y is an interval (a,b), x ^=y. Show that 

^ *i + *z+ ~ • ' + x„ j ^ ~ • +/K) 

whenever the x/s are in (a,b), with x i ^ Xj for at least one pair 


n- I 


1 
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Solution. Assume the result holds for n = m\ we will show it holds also for 
n 2m. We have 


A 


*1 + • ‘ • + 
2m 




)) 


-4i(- 

1 ( f{ x l) + ‘ ‘ ■ + /(*«) ^ f( x m+\) + ‘ ' +/(*!-) 


X m + L + • ‘ • + X 2m 


< 


2 ( 


m 


= /( X >) + f( X i) + • • • + f( X 2m) 

2m 

Thus, by induction, the result holds for all positive powers of 2. 

Now suppose that « > 2 and n is not a power of 2; that is, suppose 
that 2 m_l < « < 2 m for some integer m. Let k = 2"' - n, and set y, 
= {*1 + • • ‘ + x„)/n for /= 1,2, .... A. Then x lt x 2 , . . x„, y t , ■ ■ ■ ,y k 
are 2 m numbers in the interval (a,b), and therefore our preceding argument 
implies that 


A 


*i + •••+*„ +Ti + •■ - +y*\ , f( x i) + • • ■ +/(t*) 


< 


But note that 

*i + ‘ ' ‘ + x » + Tl + ’ ’ ‘ + /* 




2 m ) 
x ( + ■ • ■ +x H +k{ Xl + ■ ■ ■ +x„)/n 


«(*! + • ■ • + *„) + (2 m - «)(*! + • ■ • + x „) 


) 




n X 2” 


) 


Making this substitution into the last inequality, 


'( 


+ ■ ■ ■ + x, \ /(*,) + • • • +/(*„) +/Q , i)+ ■ ■ ■ +/M 


) 


< 


f( x ()+■*’+ f(x„) + */((*, + • • • + x„)/n) 
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Multiplying each side by 2™ yields 
x. 4- • • • + x 


2*/(- ' ^ * )</(».)+■■■ +/(*.) 

and from this we get the desired inequality for n: 


A 


L + ‘ ■ ‘ + \ ^ f( X l) + ' ‘ ’ + /(*") 


; ) 


< 


Problems 

7.1.7. Suppose that a, b, c are positive numbers. Prove that: 

(a) (a + b)(b + c)(c + a) > %abc. 

(b) a 2 b 2 + b 2 c 2 + c 2 a 2 > abc{a + 6 + c). 

(c) If a + b + c = I, then ab + be + ca < j . 

7.1.8. Prove that 

135 999999 r 1 

2 4 6' 1000000 ^ 1000 ' 

(Hint: Square each side and “give a little” to create a “telescoping” product 
(see Section 5.3).) 

7.1.9. 

(a) If a and b are nonzero real'numbers, prove that at least one of the 
following inequalities holds: 


a + tJci 2 + 2 b 2 

1 

a-^a 2 + 2 b 2 

2b 


2b 


(b) If the n numbers x t ,x 2 , ■ . . ,x n lie in the interval (0,1), prove that at 
least one of the following inequalities holds: 

* 1*2 ■ • • < 2“\ (I - x,)(l - * 2 ) • • • (| - x„) < 2~\ 


7.1.10. 

(a) Let a ( / bi,a 2 /b 2 , .... a n /b n be n fractions with f>, > 0 for i — 1, 
2Show that the fraction 

a t + a 2 + ■ ■ • + a n 
bi + b 2 + - - * + b„ 

is a number between the largest and the smallest of these fractions. 
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(Note the special case in which all the fractions ajbj are equal.) 

(b) If 

a + b _ c + d 
b + c d + a 

prove that either « = cor« + h + c + </ = 0. 

7.1.11. 

(a) For 0 < a < b, show that 

(n + 1 )(fr- a)a n < b n+ ' - a»+ l <(n + 1 ){b-a)b". 

(b) Apply this inequality to the special case a = 1 + 1 j(n + 1) and b = 1 + 
1/n to show that (l + l/«) n < (1 + 1 /(« + l)) n + 1 . 

7.1.12. Prove that for all n, 

7.1.13 (Cauchy-Schwarz inequality). By mathematical induction on n, 
prove that for all real numbers a h . . ., a n ,b h . . . , b„, 

7.1.14. In a convex quadrilateral (the two diagonals are interior to the 
quadrilateral) prove that the sum lengths of the diagonals is less than the 
perimeter but greater than one-half the perimeter. 

7.1.15. Prove that for any positive integer «, " i/m < 1 + VVn . 


Additional Examples 

1.3.3, 1.7.4, 1.7.5, 1.8.2, 1.8.5, 1.8.6, 1.12.7, 2.1.5, 2.1.6, 2.2.4, 2.2.6, 2.4.1, 

2.4.4, 2.4.6, 5.3.8, 6.1.3, 7.3.1, 7.4.8, 7.419, 7.4.20, 7.4.21, 7.4.22, 7.4.23. 

7.2. Arithmetic-Mean-Geometric-Mean Inequality 

Let x i > 0 for i = 1,2, . . ., n. The arithmetic mean of x ls x 2 , . .., x n is the 
number 

X[ + x 2 + - ■ ■ + x„ 


n 
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and the geometric mean of x ( , ,x* 2 ,. . ., is the number 


(* 1*2 


11 /" 


The arithmetic-mean-geometric-mean inequality states that 


. , + x 2 + 

(*r*2 ' ' ‘ x „) y < -= 


+ x„ 


with equality if and only if all the x ; ’s are equal. 

The special case n = 2 was verified both algebraically and geometrically 
in the beginning paragraphs of Section 7.1. A proof for larger values of n 
can be handled by mathematical induction (e.g., see 7.2.5 or 2.5.7), or by 
considering the concavity of the function /(/) = log / (see 7.4.20). However, 
a more enlightening heuristic (however, not a proof) can be made as 
follows. 

Consider the geometric mean (X|X 2 • • ■ and the arithmetic mean 
(jf! + * • ■ + x„)/n. If not all the x/s are equal, replace the largest and the 
smallest of them, say x M and x m respectively, by \(x M + x m ). Then, since 
j( x m + + H x m + *«) = x m + x m , and [ 4 (* w + x w )] 2 > x M x m , the re¬ 

sult of this replacement is that the geometric mean has increased while the 
arithmetic mean has remained unchanged. If the new set of n numbers are 
not all equal, we can repeat the process as before. By repeating this process 
sufficiently often, we can make the quantities as nearly equal as we please 
(this step needs additional justification, but we won’t worry about it here). 
At each stage of the process, the geometric mean is increased and the 
arithmetic mean is unchanged. If it should happen that all the numbers 
become equal (this may never happen, however; e.g., take x, = 1, x 2 = 3, 
x 3 = 4), the two means will coincide. It must be the case, therefore, that the 
geometric mean is less than or equal to the arithmetic mean, with equality 
when and only when all the numbers are equal. 

As an example of this process, consider the case x t = 2,x 2 = 4, x 3 = 8, 
x 4 = 12. The algorithm described yields the following sequences of sets: 

{2,4,8,12} ->{7,4,8,7} ->{7,6,6,7}^ ,4? 


The geometric means of the corresponding sets increase to ; the arithme¬ 
tic means remain fixed at 


7.2.1. Prove that the cube is the rectangular parallelepiped with maximum 
volume for a given surface area, and of minimum surface area for a given 
volume. 


Solution. Let the lengths of the three adjacent sides be a , b, and c. Let A 
and V denote the surface area and volume respectively of the parallelepi¬ 
ped. Then 

A = 2{ab + be + ca ) and V = abc. 
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By the arithmetic-mean-geometric-mean inequality, 

V 1 = a 2 b 2 c 1 — ( ab)(bc)(ca ) 

^ a b + bc+ca ^_ ^(ab + bc+ca) j 3 _^3 
Thus, for all a,b,c, 

6 V 2 ^<A. 

Furthermore, 6 V 2 ^ <A in all cases except for when ab — be = ca (or 
equivalently, when a = b = c), and in this case 6 F 2 ^ 3 = A. Thus, if A is 
fixed, we get the greatest volume (namely V — {A / 6 ) 3/2 ) when a = b = c (a 
cube), and when F is fixed, we get the least surface area (namely A 
= 6 F 2 / 3 ) when a = b = c (a cube). 


7.2.2. Prove the following inequality: 
h[(h + l) 1/fl _1 ]< 1 + ^ + 5 + '' 


+ i < n - (ti - 1 )« l/( " 


Solution. Let = 1 + 5 + • * • +1 /«. The leftmost inequality is equivalent 
to proving 

n + 


>(«+ \) l/n . 


which has vaguely the look of an arithmetic-mean-geometric-mean in¬ 
equality. We can make the idea work in the following way: 

n + s n n + (1 + 1/2 + • * - + 1 /m) 
m n 

_ (1 + 1 ) + (1 + 1 / 2 ) + • ■ • + (1 + 1 /n) 

M 

2 + 3/2 + 4/3 +‘ • • * + (m + 1)/m 


> 


( 2 3 4 M+ 1 V 7 " 

V 2 3 n I 


t/« 


= (« + l) 

For the rightmost inequality, we need to show that 


n s 


n- 1 


- > 
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Again, using the arithmetic-mean-geometric-mean inequality, we have 

n — s n « — (1 + 1/2 + 1/3 + • • - + I/rt) 
n — 1 n — 1 


(l-l) + (l-l/2)+---+(l-l/«) 
n - I 

1/2+ 2/3+ • ■ • +(n- 1 )/n 
n- 1 


/ 1 2 3 n - M 

{ 2 ‘ 3 ' 4 ’ ' ‘ n i 


-(i) 


1 t !/("->) 


= jj-l/fn-l). 


7^2.3. If o,h,c are positive numbers such that (1 + a)(l + f>Xl + c) = 8, 
prove that abc < 1. 


Solution. We are given that 

1 + (a + b + c) + (ah + he + ca) + abc = 8. 

By the arithmetic-mean-geometric-mean inequality, 

a + b + c < 3(ahc) l ' /3 and ab + be + ca '< 3 (abc) 2 ^ 2 , 
each with equality if and only if a = b = c. Thus, 

8 > 1 + 3(tfhc) l/3 + 3 (abc) 2 ^ + abc 

= [l + (nh C )'^] 3 . 

It follows that 


(ahc) I/3 <( 2-1) = 1, 

or equivalently, 


abc < I 

with equality if and only if a = b = c = ]. 


7.2.4. Suppose that x s > 0, / = 1,2,. . ., n and let x n+ ( = ,v,. Show that 
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Solution. Consider the case n = 3. By the arithmetic-mean-geometric-mean 
inequality, we have 



Also, 


F'r'?4(rM(rH(F) ! 


Adding these inequalities gives the desired result. The case for an arbitrary 
positive integer n is similar. 


Problems 

7.2.5. Fill in the steps of the following inductive proof of the arithmetic- 
mean-geometric-mean inequality: For each k, let A k = (x, + x 2 + • • • 
+ x k }/k, and G k = (x,x 2 • • • x*) l/ *. Assume that we have shown A k > G k . 
Let 


x k + i + (k- l)A k 
k 


and G = (x k A£ + ?) l/l< . 


Then, using the inductive assumption, we have A > G, and it follows that 
A k + l - \(A k + A)> {A k A)^ 2 > (G k G) l/1 = (G^A^)'^ . From this 
it follows that A k + X > G k + \- On the basis of this argument it is easy to 
prove the equality holds if and only if all the x y ’s are equal. 


7.2.6. If a,b,c are positive numbers, prove that 


( a 2 b + b 2 c + c 2 aj(a 2 c + b 2 a + c 2 b ) > 9 a 2 b 2 c 2 . 


7.2.7. Suppose that a t , . . ., a„ are positive numbers and b lt ... ,b„ is a 
rearrangement of aa n . Show that 



7.2,8. 

(a) For positive numbers a and b, b, prove that 

/ a hi\ i /i n+ a + nb 
K } n + 1 ’ 
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(b) In part (a), consider the case a = 1 and b = 1 + l/« and show that 

(c) In part (a), replace n by n + 1, let a = 1 and b = rt/(n + 1), and show 
that 




i y +2 

n+l) 


7.2.9. For each integer rt > 2, prove that 

<a» n UHf-fp 

(b) 

(c) 1 X 3 X 5 X * • • X (2n - 1) < n". 


7.2.10. Given that all roots of x 6 — 6x 5 + ax 4 + bx 3, + cx 1 + dx + 1=0 
are positive, find a,b,c,d. 

7.2.11. 


(a) Let x t > 0 for i = 1,2, . . ., «, and let p y , p 2 , ■ ■ ■, p„ he positive inte¬ 
gers. Prove that 


{xP'xp ■ ■ ■ x^) l/ip,+ " +pJ < PlX ' + 

P\ + 


• • +p* x » 
■ ■ +p„ 


(b) Prove the same result as in part (a) holds even when the p ; ’s are positive 
rational numbers. 


7.2.12. Use the arithmetic-mean-geometric-mean inequality for each of 

the following: 

(a) A tank with a rectangular base and rectangular sides is to be open at 
the top. It is to be constructed so that its width is 4 meters and its 
volume is 36 cubic meters. If building the tank costs $10 per square 
meter for the base and $5 per square meter for the sides, what is the 
cost of the least expensive tank? 

(b) A farmer with a field adjacent to a straight river wishes to fence a 
rectangular region for grazing. If no fence is needed along the river, and 
he has 1000 feet of fencing, what should be the dimensions of the field 
so that it has a maximum area? (Hint: It is equivalent to maximize 
twice the area.) 

(c) A farmer with 1000 feet of fencing wishes to construct a rectangular 
pen and to divide it into two smaller rectangular plots by adding a 
common fence down the middle. What should the overall dimensions 
of the pen be in order to maximize the total area? 

(d) Prove that the square is the rectangle of maximum area for a given 
perimeter, and of minimum perimeter for a given area. 
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(e) Prove that the equilateral triangle is the triangle of maximum area for a 
given perimeter, and of minimum perimeter for a given area. (Hint: 
The area of a triangle is related to the perimeter of the triangle by the 
formula A = (s(s - a)(s - b){s - c)) l/2 , where a,b,c are the lengths of 
the sides of the triangle and s = \ P, P the perimeter of the triangle.) 


Additional Examples 

Introduction to Section 7.6; 7.3.1, 8.1.4. 


7.3. Cauchy-Schwarz Inequality 


Let a, > 0 and 6, > 0 for i = 1,2, . . ., n. The Cauchy-Schwarz inequality 
states that 


ft 


2 a, A < 



with equality if and only if = a 2 /bi — * • • ~ u n /b„. 

A proof can be given using mathematical induction (see 7.1.13). But 
an easier approach is to consider the quadratic polynomial P(x) = 
i(d;X — fe,) 2 - Observe that P(x) > 0 for all x; in fact, P(x) = 0 only 
under the conditions in which a ] /b l = a 1 /b 2 = ■■■ =a„/b l! and x 
= bj/aj. Now 

P(x) = 2 {afx 1 - 2afrx + bf) 

i - i 

=(I,4 2 - 2 (Ia K?a 


and since P(x) > 0, the discriminant of P cannot be positive, and in fact 
will equal zero only when P(x) ~ 0, Thus, 


or equivalently, 


n i n \ { n v 1 / 2 

2*A<(2«i 2 ) (2*. 2 ) , 


with equality if and only if a x j 6, = • - • = a„/ b„. 
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In this inequality, note that the requirement that the a, and b t be positive 
is redundant, since for all a,, b, 


n 


n 


2>A<2NAI< 

/=l i-| 



73.1. If a r b,c > 0, is it true that a cos 2 # + b sin 2 # < c implies 4a cos 2 # 
+ 4b sin?# <4c ? 


Solution. By the Cauchy-Schwarz inequality 
(a cos 2 # -fJTsin 2 # 

<^(|cfcos#) 2 + (^sin#) 2 ] l ^(cos#) 2 + (sin#) 2 j l/2 
= (u cos?# + b sin?#) 1/2 

< 44 . 

There is also a nice solution based on the arithmetic-mean-geometric- 
mean inequality: 

2 

(Vo cos 2 # + 4b sin 2 # ) = a cos 4 # + 2 4a t/# cos 2 # sin 2 # + b sin 4 # 

< a cos 4 # + (a + #)cos?# sin 2 # + b sin 4 # 

= (a cos?# + b sin?#)(cos?# + sin?#) 

< c. 

Another solution, more geometric in nature, is given in 7.4.19. 


73.2. Let P be a point in the interior of triangle ABC, and let r u r 2 ,r 3 
denote the distances from P to the sides a t ,a 2 ,a$ of the triangle respec¬ 
tively. Let R denote the circumradius of ABC. Show that 




with equality if and only if ABC is equilateral and P is the incenter. 


Solution. By the Cauchy-Schwarz inequality 


+ 4r4 + 444 = yl/«7 + 4a4.r 2 yj\/a 2 + vW 3 \/T ~[a 

< (a,r t + a 2 r 2 + a 3 r 3 )'/ 2 ^ A- + ± + ±-'j 


3 

1/2 
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with equality if and only if 

77. = 77 = 7*71 

777 777 777 


or equivalently, if and only if 

7 r i = 7 r i = 7r 3 ■ 


In the preceding inequality, we recognize that a l r l + a 2 r 2 + a 3 r 3 = 2 A, 
where A is the area of the triangle. Also, we know that the area of a 
triangle, in terms of the circumradius R, is given by A = a x a 2 a 3 /AR (see 
8.1.12). Therefore, a l r l + a 2 r 2 + a 3 r 3 = a,a 2 a 3 /2R, and we have 


_ , a^a, y/V VrtW, \ 
\ 2R ) \ a x a 2 a 3 / 


1/2 


~ -= (<* 2 a 3 + a 3 a l + « l a 2 ) 1/2 . 

&R V ' 

Now, again by the Cauchy-Schwarz inequality, 

+ fl 3 «i + <*i«2 < (7 + 7 + 7) 1/2 (7 + 7 + 7) 1/2 

= (a 2 + a 2 + a]) 

with equality if and only of a 2 /a 3 = a 3 /a l = a l /a 2 {— (a 2 + a 3 4- a x )/ 
(a 3 + a, + a 2 ) = 1; see 7.1.12), or equivalently, if and only if 


— £?2 — Oj. 

Thus, we have 

7 + 7 + 7 < -==■ (7 + 7 + 7)' /2 

7r 

with equality if and only if afr t = a\r 2 — a\r 3 and a, — a 2 = a 3 ; that is, if 
and only if a, = a 2 = a 3 and r l = r 2 = r 3 . This completes the proof. 


733. Given that a,b,c,d,e are real numbers such that 


a+b+c+d+e= 8, 
a 2 + b 1 + c 1 + d 2 + e 2 = 16, 
determine the maximum value of e. 


Solution. The given equations can be put into the form 

8 — e = a + b+ c+d, 

1 6-e 2 = a 2 + b 2 + e 2 + d 2 . 
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We wish to find an inequality involing only e; the Cauchy-Schwarz 
inequality provides a way, since 

(a + b+ c + d) <(1 + 1 + 1 + l) l/ 2 (a 2 + b 1 + c 2 + d 1 )' /2 . 

Making the substitutions given'above, and squaring, we have 

(8 - ef < 4(16- e 2 ), 

64 — I6e + e 2 < 64 — 4e 2 , 

5e 2 ~ 16e < 0, 
e(5e — 16) < 0. 

It follows that 0 < e < The upper bound, is attained when a = b = c 
= d = f. 

73.4, Suppose that a lt a 2f . . . , a„ are real (n > 1) and 

Prove that A < 2a l a / for 1 < i <j < n. 

Solution. By the Cauchy-Schwarz inequality 

n 

2 fl ;J -[(<»i + fl 2 ) + a 3 + • • • +a„f 

<(!+**■+ 1)((«l + o 2 ) 2 + Oj + • • • + a 2 ) 

■ n 

= («-!) 2 a f + 2a i a 2 

L I 

This, together with the given inequality, implies that 

< -( 2 ^)+ ("“O itaf + loiai 

= 2 a l a 1 . 

In a similar manner, A < 2a for 1 < i <j < n. 

73.5. Let jc ; > 0 for i = 1,2 , ,n. For each nonnegative integer k, prove 

that 

** + ■ • • + X* Jtf + l + • • • + jc* + i 
n Jc, + * * * 4 - x„ 
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Solution. We may assume without loss of generality that x ( + • • • + x a 
= 1, for if not, we can replace x t by X, = x i /(x l + • • • + x„). 

The result holds when k = 0. Assume the result holds for all nonnegative 
integers less than k. By the Cauchy-Schwarz inequality, 

n n „(* —t)/J 

^ -L _ ^ + ti _ 




1/2 


By the inductive assumption, Sy-i** ’/» < Si-i*!** an< ^ therefore, con¬ 
tinuing from the last inequality, we have 


/ « \ 1 / z / n / « \ '/ 2 / « 

(,?,>) ,?,T ' 


1/2/ R \’/2 


Thus, 


n ic / n \l/2/ a X .*V I/Z 

/ a * \ l/2 / n \ l /2 


n v Jt rt 

i=i n t-i 

By induction, the proof is complete. 


Problems 

73.6. Use the Cauchy-Schwarz inequality to prove that if a,,, a n are 
real numbers such that a ( + • • ■ + a„ = 1, then a 2 + ■ ■ • a z > 1 fn. 

73.7. Use the Cauchy-Schwarz inequality to prove the following: 

(a) If p\, - - • ,p„,x h . . ., jc„ are 2 m positive numbers, 

(Pi x i + ' ' ‘ +P***) i <(Pi+ • • • +P*){prf + ‘ ■ ' +P n *l)- 

(b) If a,b,c are positive numbers, 

( a 2 b + b z c + c 2 a)(ab 2 + be 1 + ca 2 ) > 9 a 2 b 2 c 2 . 

(c) If ** , y k , k = 1,2,, . ., m, are positive numbers, 

n / n \ 1/2/ n \ 1/2 

2 **^*<1 2 K 2 ) 2 **/*) • 

Jt« I \fr-l / \Jt-l / 
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(d) If a k , b k , c k , k = 1,2, .. ., n, are positive numbers, 

(e) If C k — (*) for n > 2, 1 < k < n, 

k = I 


73.8. For n a positive integer, let (a l ,a 2 , .. ., a n ) and (b h b 2 , .. . , 6 n ) be 
two (not necessarily distinct) permutations of (1,2, . . ., n). Find sharp 
lower and upper bounds for a l b l + • • • + a n b R . 


73.9. If a,b,c,d are positive numbers such 
prove that 



that c 2 + d 2 = (a 2 + b 2 )\ 


with equality if and only if ad = be. (Hint: Show that (a 3 /c + b 3 jd\ac + 
bd) > (a 2 + b 2 ) 2 > ac + bd.) 


73.10. Let P be a point in the interior of triangle ABC, and let r i ,r 2 ,e i 
denote the distances from P to the sides a ]> a 2 ,a 2 of the triangle respec¬ 
tively. Use the Cauchy-Schwarz inequality to show that the minimum 
value of 


a. a. 
— + — 

G r 2 


+ — 




occurs when P is at the incenter of triangle /liJC. (Hint a t = 4a j r i -ja-Jr t .) 


Additional Example 
7.6.14. 

7.4. Functional Considerations 


In this section we will give examples to show how the techniques of 
analysis, particularly differentiation, can be used effectively on a wide 
variety of inequality problems. 


7.4.1. Given positive numbers p, q, and r, such that Ip = q + r, q ¥= r, 
show that 
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Solution. Suppose that q and r are positive integers, and consider the q 
numbers 1 f q, . . ., 1 / q and the r numbers 1 / r, . . . , 1 / r. By the arithme¬ 
tic-mean-geometric-mean inequality, 

/ , j ^ ?(!/?)+ r(]/r) ^ 

\qi ' r'} < q+r p' 

which is equivalent to the desired inequality. 

Of course, this method breaks down if either q or r is not an integer, so 
how shall we proceed? One idea is to rewrite the inequality in the following 
manner: 


p<> + r < q 9 r r . 



1 / q y /< ‘i + r >j r y/(<?+o 

2 < \ q + r) \ q+r) 

Set x = q/(q + r ) and y = rj(q + r). Observe that x + y = 1 and 
0 < x, y < 1 . Then the problem is equivalent to proving that 

0 < * < 1 , x=pi. 

By introducing the function in this way, we are able to use the methods 
of analysis. The idea is to find the minimum value of F on (0,1). To 
simplify the differentiation, we will consider the function G(x) = log/^x). 
To find the critical points, we differentiate: 

G '(*) = ^ + (1 “ *)l°g(l ~ *)] 


= (log* + 1) - 1 - log(l - JC) 


= log 


X 

]-x‘ 


We see that <?'(■*) = 0 if and only if jc = |. Furthermore, G’{x) < 0 on the 
interval (0,t), and G\x) > 0 on the interval 1). Therefore G{x) takes its 
minimum value on (0,1) at x = Thus, the minimum value of F(x) on 
(0,1) is F(^) = = I - It follows that F(x) >t for all x in (0,1), 

x and the proof is complete. 


7.4,2. Let p and q be positive numbers with p + q = 1. Show that for all x, 
pe x/ P + qe~ x t* < 
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Solution. Consider the function 


F( X ) = 


pe 


'tp + qe~ x h 


2 /»pV 


Our problem is to prove that F(x) < 1 for all x. Because of the symmetry in 
the problem, it suffices to prove that F(x) < ] for all x > 0. 

We note that F(0) = 1. By Corollary (in) of the mean-value theorem (see 
the discussion preceding 6.6.2), it suffices to prove that F'(x) < 0 for all x. 
To simplify the computation, consider the function G(x) = logFfjc). Rou¬ 
tine differentiation and algebraic simplification yields 

= F '^ = e * /p ~ e ~ x/q - x 
K } F ( x ) pe*/? + qe-*/i 4p 2 q 2 

= e x/ ™ - 1 _ x 
pe x/pq + q 4p 2 q 2 

Since F(x) > 0 for all x > 0, F'(x) < 0 if and only if G'(x) < 0. Unfortu¬ 
nately, the preceding expression for G'(x) makes it difficult to determine 
whether or not G'(x) < 0. Therefore, we will carry the analysis through 
another step. Namely, G'(0) = 0, and (again leaving out the details) 


G"(x) 


(. Pt *~ 

4p 2 q 2 (pe x /w + qf 


Here is is clear that G"(x) < 0 for all x > 0. This, together with G'(0) = 0, 
implies that G'(x) < 0 for all x > 0, and this in turp implies F'(x) < 0 for 
all x > 0. Therefore, since F( 0) = 1, it must be the case that F(x) < l for all 
x > 0, and the proof is complete. 


The procedure used in the preceding problem is very common. To 
recapitulate, it goes like this: To prove an inequality of the form 

f(x) > £(*), x> a, 

it is equivalent to prove either that 


or that 


e<*) = 


/(*) 

g( x ) 




x > a. 


D ( x ) = /(*) “ > 0. x> a. 


Each can be done by showing the inequality for x = a and then by showing 
that Q'(x) > 0 (or 2>'(x) > 0 respectively) for all x > a. 

In the previous example, if we had considered instead the function 

D(X) m _ px/p _ ye-*/*' 


this analysis wouldn’t have been conclusive: even though Z)(0) — 0, it is not 
necessarily the case that D'(x) > 0 (for example, when/> — ^,^ = j,x = j). 
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7.43. Prove that for all real numbers a and b, 

\a + b\ p < \a\ p + \b\ p , 0 < p < 1. 

Solution. The inequality is trivial in several special cases. For example, the 
result holds if a = 0, or if a and b have opposite signs. Also, if p = 0 or 
p = 1, the result is true. Therefore, it suffices to show the result is true when 
a and b are positive and 0<P< 1. 

For such a and b and p, let x = b/a. Then, the problem is to show that 

(1 + x) p < 1 + x p , x>0, 0 < p < 1. 

For this, let 2>(x) = 1 + x p — (1 + xf. We have Z)(0) = 0 and D'(x ) 
= px r~ l - p(\ + x y-' > 0, so by our earlier remarks, the proof is com¬ 
plete. (Note that if p > 1, the inequalities would be reversed.) 

7.4.4. On [0,1], let / have a continuous derivative satisfying 0 < / (t) < 1 • 
Also, suppose that /(0) = 0. Prove that 

X'AO^jC'lAOlV 

Solution. Here, as in the last example, it is not clear how to make use of 
differentiation. The idea is to introduce a variable and prove a more 
general result. For 0 < x < 1, let 

Then F(0) = 0, and 

F'(*) = 2 

= /(*)[ 2 j [*/<')<*"[/(*)] 2 • 

We do know that f(x) > 0 for 0 < x < 1 (since we are given /(0) = 0 and 
f{x) > 0); however, it is not clear that the second factor in the last 
expression for F’ is nonnegative. Therefore, let 

G ( x ) = 1 P7(0 a -[/(*)] J - 0 < x < I. 

•JQ 

Then G(0) = 0, and 

G\x) = 2 f{x) - 2 f(x)f\x) 

= 2/(*)[l—/'<*)] >0 

(the last inequality holds because f(x) > 0 and, by hypothesis, 1 — f(x) 
> 0 ). 
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It follows from these arguments that F(x) > 0 for all x, 0 < x < 1; in 
particular, F(l) > 0 and the proof is complete. 

7.4.5. Show that if x is positive, then log(l + 1/jt) > 1/(1 + x). 


Solution. Let f(x ) = log(l + 1 / x) — 1/(1 + x) (= log(l + x) - log* - 
1/(1 + x)). Then 


/'(*) = 


- -i + 


(i + *r 


+ *)~Q +*f + x 
x(l + X ) 2 


= ——!— <0 for JC > 0. 
*(1 + xf 


Furthermore, lim^^^/fjc) = 0, and this, together with f'(x) < 0 for x > 0, 
implies that f(x) > 0 for x > 0. 


7.4.6. Find all positive integers « such that 

3" + 4" + - * * +(n + 2)"=(n + 3) fl . 


Solution. A direct calculation shows that we get equality when n = 2 and 
when n = 3. A parity argument shows that it can’t hold when either n = 4 
or n = 5. Based on this beginning, we might expect that the key insight 
should involve modular arithmetic in some way. However, these attempts 
aren’t fruitful, and we look for another approach. We will show that 

3 fl + 4" + • • • +(n + 2) fl <(« + 3) 3 

for n > 6, and thus equality holds only when n = 2 or n = 3. 

The inequality we wish to prove can be written in the following form: . 



or, reversing the order for convenience, 



To prove this inequality it suffices to show that 

(* “IT+t) < (l) ’ k=\,2,...,n. 
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For then, 


i\ 1 


1 V+-- 

[ « + 3 

) + v « 

+ 3 / + 

<l + i 


■+(!)"< 

2 

1 2 ) 

V 2 J 

It remains, then, to prove that 



*-'' 2 .*■ 

By Bernoulli’s inequality (a very useful inequality; see 7.4.10), 

and therefore, 

(-^r<c-^r-[c-^)T 

The final step is to show that 

(1 — —^ < I- when n > 6. 

I n + 3 } 2 

For this, consider the function 

'w-O-d*)- 

It is straightforward to show that F’(x) < 0 for x > 6, and that F(6 ) < {. 
Thus, the proof is complete. 


7.4.7. Prove that for 0 < a < b <}ir, 
b ~ a 


cos 2 n 


< tan b — tan a < 


b - a 
cos 2 b 


Solution. Consider the function /(x) = tan;c on [a,b\. According to the 
mean-value theorem there is a point c in (a,b) such that 


f(b)-f(a) 

b-a 


= f( c Y 


In this case, this means that 


tan b — tan a = ^ 
b — a 


for some c in (a, b). The desired inequality follows from the fact that 
sec 2 <2 < sec 2 c < sec 2 6 for 0 < a < b < ir/2. 


Many inequalities can be established by considering an appropriate 
convex (or concave) function. The idea is based on the result of 6.6.3: if 
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/: R -*■ R is such that f'{x) > 0, then 

a x + y \ s- f ( x ) + -At) 

—r~* 

and if /"(*) < 0, then 

For example, for real numbers a and b, 

because f(x ) = x 2 is a convex function. As another example, 
0 < x, y < it. 


x + y \ sinx + sin y 


■ y\ 

sm ( 2 ) 


2 ;" 2 

because/(x) = sinx is a concave function on (0, •n). 


if 


7.4.8. Prove that if a and b are positive numbers such that a + b = 1, then 




2 25 


Solution. We have seen that 


Take x = a + 1 fa andy = 6+1 / b. Then 


1 

2 








2 



But by the Cauchy-Schwarz inequality (1 /a + 1 /b\a + b) > (1 + l) 2 = 4, 
so that 



4 

a + b 



25 
4 ‘ 


The result follows after putting together the two preceding inequalities and 
multiplying each side by 2. 


7A9. Let 0 < x y < it, i = 1, .. ., it, and set x = (x ( + x 2 + • • • + x n )/n. 
Prove that 
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Solution. The problem is equivalent to proving that 


sin x. 


Consider the function 


2 log-' < «log 


/(t) = log^ 


sin x 


It is a straightforward matter to show that / is concave (/"(*) < 0) on the 
interval (0, tt). Therefore 

■ / Jt, + JC 2 \ /(*l)+/(*2) 




2 ) 2 

In a manner completely analogous to the proof of 7.1.6, it follows that 


A 


*1 -+-•••+ \ ^ f( X l) + ■ ‘ • + /(■*„) 


n f n 

Direct substitition into this inequality completes the proof: 




, sm*. 
log-+ 


+ log 


sinx^X 
*« / 


Problems 


7.4.10 (Bernoulli’s inequality). Prove that for 0 < a < 1, 

(1 + 1 + ax, x > — 1. 

How should the inequality go when a < 0, or when a > 1? 
7.4.11. Prove that 

* . . ^*(*+ 2 ) 


< 108(1 + *) <J^. *>°- 


7.4.12 (Huygens’s inequality). Prove that 

2sinx + tanx > 3x, 0 < x < n/2. 

7.4.13. For all x > 0, (2 + cosx)x > 3sinx. 

(a) Prove this inequality by considering the function F(x) = x — (3 sin x)/ 
(2 + cos x). 

(b) Prove this inequality by considering the function F(x) = (2 + cosx)x — 
3sinx. 

7.4.14. Prove that 

r, „ * lo S* ^ 1 
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7.4.15. Prove that 

S'-JT 

7.4.16. Prove that 

( a + 1 
\b + 1 


3 ) + (x + 2,u +3) ' ° 

r>(fr, «»o, 


j: > — 2. 


aj=b. 


7.4.17. Prove that 


sing a tan a 

sin b b tan b ’ 


0 < b < a <{ir. 


7.4.18, Use the methods of this section to prove that for each positive 
integer n, 

(That is, show that f(x) = (l + \/x) x is an increasing function.) 


7.4.19. Use the concavity of f(x) = ftc to prove that if a , b, c are positive, 
then a cos 2 # + b sin 2 # < c implies ft> cos 2 # + ft sin 2 # <i Ic . (Hint: Sketch the 
graph of f(.x) = ft . In the domain, where is the point geos 2 # + b sin 2 #, and 
in the range, where is ft cos 2 # + ft sin 2 #?) 


7.4.20. Let x t ■ > 0 for / = 1,2, . . ., n. Consider the function /(f) = log/, 
and in a manner similar to that used in 7.4.9, prove-that 




x. + x 2 + 


+ 


with equality if an only if all the jq are equal. 


7.4.21. 


(a) Let x t > 0 for / = 1,2, . . ., n. Use the result of 7.4.20 to show that 


rt 


JC, 



<(*1*2 ■••*») 


l/n 


(b) For positive numbers a,b,c such that 1/g + l/b + \/c = I, show that 
(a - IX* - l)(c- 1) > 8. 


7.4.22. Show that if a, b, c are positive numbers with a + b + c = I, then 




100 

3 


7.4.23. Let aft, c denote the lengths of the sides of a triangle. Show that 


3 ^ a , b , c 
2 b + c c + a a + b 


< 2 . 
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Additional Examples 
6.4.6, 6.4.7. 

7.5. Inequalities by Series 


Another way to prove an inequality of the form 

f(x) < g(x), 0 < x < c 

(see the discussion preceding 7.4.3) is to expand / and g in power series, say 
f(x) = Sn-tA*’ 1 and £(*) = f °r x in the interval {-d,d). If it 

should happen that a„ < b n for ail n, then it is obvious that f(x) < g(x) for 
all x in the interval (0 ,d). 


7.5.1. For which real numbers c is \{e x + e *) < e 1 ’* 2 for all real xl 


Solution. If the inequality holds for all x then 

0 < e" ! — e x + e~ x ) 


-2 


C X 


In 


00 JC 2 " 


n — 0 


ni 


\ i A 

A 1 ”, 


« = 0 


2"n\ 



1 \ jc 2n 
2" ) n! 

1 \ 

2 " ) «! 


To see that c >{, divide each side by x 2 and set x = 0. 
On the other hand, if c > ^, 


“ v 2 " 

l( e + (1^7 


" * 2 " 


< y 

n -0 2 -«! 

= g j3 / 2 


< e 


ex 1 


It follows that the stated inequality holds for all x if and only if c > 


1 

2 ’ 
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Another important series technique with application to inequality prob¬ 
lems concerns alternating series. Recall that if a Q ,ai,a 2 , ... is a sequence 
of positive numbers, then the series 0" fl n converges provided the 

terms steadily decrease to zero (i.e., a n+ , < a n and a„ -»0 as n-* oo). More 
importantly for our purposes here, the sum of the series lies between any 
two successive partial sums. (If S denotes the sum of the series and S„ 
denotes the nth partial sum, then { S ln+ ,} is an increasing sequence, { S 2 „ 1 
is a decreasing sequence, and for all n , S ln+ , < S < S 2 „.) 

7.5.2. Show that for all x, 

1 + x + IT + 


„2n 


(2 »)! 


> 0 . 


Solution. Certainly the claim is true when x is positive or zero. When x is 
negative the series on the left is an alternating series, and because 2 m is 
even (so the last term in the finite sum on the left is positive), the reasoning 
preceding the problem implies that 


‘ + j, + !i + ■ 



>1 + jt+ !r + - 


r 2 " 

j -r.-- 4. 

(2»)1 


e* > 0. 


7.5J. Prove that (2 + cos x)x > 3 sin x, * > 0. 


Solution. This is the same problem as 7.4.13, but here we will give a 
solution based on series considerations. 

On the left side of the desired inequality, we know that for x > 0, 

(2 + cos*)* > (2 + 1 - £ + £ _ |! 

and on the right side 


3sin* < 3^* “ Jp + J- 

Therefore, it is sufficient to prove that 

3*— — + — — — > 3 /* — — + 
2 ! 4! 6 ! \ 3! 
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This is true for x > 0 if and only if 

__ x^_ ^ 3* 5 

4! 6! 5! ’ 

(±-±\>± x * 

\ 4! 5! } 6! ’ 

* i< 6 ! (j,H 2 - 

This proves the desired inequality for the case in which 0 < * </f2 . 
But the inequality is obvious for * >{l2, and therefore, it is true for all 
* > 0, and the proof is complete. 

In the preceding proof, one might ask why these many terms from the 
infinite series were chosen. Why not more or less? To keep the inequalities 
going in the right direction, we need to underestimate cos* and overesti¬ 
mate sin*, thus dictating the signs of the final terms in the series approxi¬ 
mations, The crudest estimate would be to replace cos* by 1 — x 2 /2 and to 
replace sin* by *. This leads us to investigate 

which is equivalent to 

- ~ > 0 
2 > 

and this is not true for any positive value of *. 

As the number of terms in the series increases, the approximations 
improve, so the next try might be to replace cos* by 1 — * 2 /2 -I- x 4 /4! — 
x 6 /6! and sin* by * - x J /3l + x 5 /5!. This leads to the solution as it was 
presented. 


7.5.4. Prove that 




0 < * < 


Solution. For * > 0, 


and 


* 2 . * 4 
2 12 


Therefore, it suffices to show that 


i ** . *' * 

* ^ T 


£ 

12 


6 

216 


* 

216 ’ 


cos* <1 2 + 4! 6! + 8! 


> 1 + 

2 J * *ri a • 


4! 6! 


£ 

8t 




7.6. The Squeeze Principle 


271 


or equivalently, 


— + (—— -i—L ^2 _ — x 4 > o 

4! \ 216 720 f 8! 


The left side is decreasing on the interval (0,^w], and therefore takes its 
minimum when x = ^ ?r. In particular, for 0 < x < 2, 



'--L + -LW2 
. 216 720 ) 


This completes the proof. 


fT-' 4> ¥ + (-ii6)< 2 > 2 -iri< 2 > < 

. x _ A . J_ = 30-24-1 
4! 5! 6! 61 


> 0 . 


Problems 

7.5.5. Use infinite series to prove the following inequalities: 

(a) e x > 1 + (I + x)log(l + x), x > 0. 

(b) (1 4- x)/{\ -x)> e 2 \ 0 < x < 1. 

(c) arcsinx < x/(l - x 2 ), 0 < x < 1. 

7.5.6. Prove that Vi + x — 1 — tx + ^x 2 <^-x 3 , x > 0. 

7.5.7. Prove that 

x<i (2sinx 4- tanx), x > 0. 

[Hint: Show the equivalent inequality, 

sinx(2cosx + 1) > 3xcosx, x > 0.] 

7.5.8. Show that sin 2 x < sinx 2 for 0 < x 


7.6. The Squeeze Principle 


In this section we will see how inequality considerations can play an 
important role in evaluating limits. The key idea (which has many varia¬ 
tions) is expressed in the following result. 

The Squeeze Principle. If {a„}, { b „}, {c„} are infinite sequences such that a„ < b„ 
< c„ for all sufficiently large n, and if { a n } and ( c n } converge to the same 
number L , then {£>„} also converges to L. 

As innocuous as this principle appears (obviously, there is no alternative 
for d is “squeezed” between {a„} and {e„}, both of which are 

converging to the same limit), it is surprising that it can be useful in 
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problem solving. Nevertheless, it is applicable in the following situation. 
Suppose we wish to evaluate the limit of a sequence {£„}, and suppose the 
b n *s are hopelessly complicated, so that they cannot be handled directly. 
The squeeze principle suggests that we try to “squeeze” {b n } with two 
simpler sequences { a n } and {c„}. 

For example, consider the sequence {n ]/n }. We could evaluate this limit 
by L’Hopital’s rule; however, consider the following argument. By the 
arithmetic-mean-geometric-meaninequality, 



Now, by the squeeze principle (with a„ = 1, and c n = 1 4- 2(1 /i/m — 1 /«)) 
we see that n 1 is forced to converge to 1. 

7.6.1. Prove or disprove that the set of all positive rational numbers can be 
arranged in an infinite sequence {£„} such that {(b n ) l/n } is convergent. 

Solution. We begin by ordering the rational numbers by following the usual 
serpentine path through the square array of rationals shown in Figure 7.2, 
where we omit all fractions not reduced to lowest terms. The sequence thus 
begins l,j,2,j,4,5,^,^, . . . . If b„ denotes the nth term of this 
sequence, we would like to prove that {by n } converges to 1. 

In Figure 7.2, observe that every element in the nth row is less than or 
equal to n, and every element in the nth column is greater than or equal to 

sn /i ^ 

1 1/2 1/3 1/4 1/5 



2 2/2 2/3 2/4 2/5 



4 4/2 4/3 4/4 4/5 



Figure 7.2. 
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\/n. Also, if b n occurs in row i and column j, then i < n and j < n. 
Therefore 


— < — < b„ < i < n 

« J 


for all«, 


and consequently. 



Now, by the squeeze principle, {i>y n } converges to 1. 


7.6.2. Let f(x) be a real-valued function, defined for - 1 < x < 1, such 
that/'(0) exists. Let {«„}, {b„} be two sequences such that 

- 1 < a„ < 0 < b < 1, lim (a„) = 0 = lim ( b n ). 

n ft-* CO N ' ft-*<x> v 

Prove that 


lim 

/j— 


f(K)-/(<**) 


-Ab¬ 


solution. The quotient 

f{ b n) -/(«■) 

K ~ 

can be interpreted geometrically as the slope of the line segment 
/<«„)). QJKJihJ) (see Figure 7.3.) 

Let R be the point (0, /(0)). Either the y-intercept of the line segment 
P„Q„ is less than or equal to/(0) (case 1), or it is greater than/(0) (case 2). 


(Case 1) 



Figure 7.3. 
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In the first case, 

Slope RQ„ < Slope P„ Q n < Slope 

or equivalently, 

f(K) -m /(».)-/(«■) M) ~/(0) 

K - 0 - o 

In the second case, 

Slope P„R < Slope P„Q„ < Slope /?£>,,, 

or equivalently, 

/(M-/K) < m-m 

a n ~ 0 “ «„ “ 0 

In case 2, the inequalities are just reversed from those in case 1. To 
correct this, we define two new sequences which reverse the roles of a n and 
b n in case 2. Thus, let {c„} and {d„} be defined by 

c n = b n and d n = a n if case 1 holds for a n and b n , 

c„ = a„ and d„ = b if case 2 holds for a „ and b„ . 

rl ft ft ft n ri 

Then, for all n, 

/(^)-/k) M)-/(°) 

C„ - 0 b„ - a„ d„ - 0 


Since/'(0) exists, and since lim n ^ w c n = 0 = lim^^d,,, 


lim ^ Cn) ^° } =/'( 0 ) and 

n-KX> C — Q J v ' 


„ m /KW(0) 

n->t» d„ ~ 0 ' 


The result now follows from the squeeze principle. 

Another instructive solution, also based on the squeeze principle, is 
based on the fact that if a and b are real numbers, a < b, then 

a < ra + sb < b 

for all positive numbers r and 5 that add to 1 (see 1.2.11). In this problem, 
write 


/(M ~ /K) 

K ~ a„ 




)fe) 


and set r =» b„/(b n — a „) and .1 = — a„l(b„ — a H ). Then r > 0, s > 0, and 
r + * - I - Therefore [f(b B )-f(a„)]/[b n - a„] lies between [f(b„) - /(0)]/6„ 
and - /(0)]/a„. Since these latter quotients converge to /'(0), so also 

must lf(b„) — f{a„)]/[b„ — a„] by the squeeze principle. 
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Figure 7.4. 


7.63. Evaluate 
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so, by the squeeze principle, 

kn j / n 2 

lim 5] --—— < lim Y , H , < lim arctann 2 , 

”- f0 ° /Ti 1 4- (j/nf yTi « 2 + / n ^“ 


jC 


* dx 


< lim 2 <>• 


1 + x 2 n ^“jT| n 2 + / 


But, since & was an arbitrary positive integer, we must have 

n 2 

isr =■ lim arctan&< lim Y ” , <iff. 

*-*■0° n -'” jZ I rt 2 + / 

It follows that the desired limit equals jsr. 

Another important application of inequalities to the evaluation of limits 
is based on the following important fact. 

Monotonic , bounded sequences converge. 

That is to say, if { a„} is a sequence of real numbers such that a n+l > a n for 
all sufficiently large n (or a n + l < a n for all sufficiently large a), and if for 
some constant K, a n < K for all n (or a n > K respectively), then the 
sequence {a„} converges. 

For example, to prove that the sequence (1 + 1/n)" converges, it is 
sufficient to prove it is monotonic (increasing in this case) and bounded 
above (by 3; see 7.1.5). 


7.6.4. If {«„} is a sequence such that for n > 1 

(2 - <J n )a„ + | = 1, 
prove that lim^^a,, exists and is equal to 1. 


Solution. First we will prove that if the sequence converges, it must 
converge to 1. The argument is standard when a sequence is defined recur¬ 
sively as it is here. Let lim^^a = L. Then, taking the limit of each side of 
the recurrence relation (2 ~ a n )a n + i = l, we see that (2 — L)L = 1, or 
equivalently, (L — l) 2 = 0, from which we conclude that L = 1. 

Now, to prove that the sequence converges, we will prove that it is 
bounded, and “eventually” becomes monotonic. (For another solution of 
this problem, see 1.1.11.) 

Suppose that for some a„ , 0 < a n < 1. Then 

1 ~ (2 - a„)a„ 


3n+i a * a " 


2 -a. 


(1 - <*»? 


2 -a. 


>0 
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and tf n+1 = 1/(2 - a n ) < 1. Therefore, a n < a„ +1 < a„ +2 < ‘ ‘ ‘ < 1, so the 
sequence is monotonic and bounded and therefore converges. Thus, it 
suffices to prove that for some n f 0<a„<l. There are several cases. 

If < 0, then 0 <a 2 < 1/(2 - a,) < 1, so we’re done by the preceding 
argument. 

If a, > 2, then a 2 = 1 /(2 — a{) < 0, so again we’re done. 

If a, = 1, then a n = 1 for all n. 

It remains to check the case 1 < «, < 2. Some playing around with 

special cases in this interval leads to the following (each of which can be 

proved by induction). 

First, the recursion cannot hold for all n if a Y has the form (n + 1 )/n. 
For if a l ~(n + l)/«, then (one can show that) a„ = 2 and consequently 
a n+ 1 cannot be defined. Secondly, if a t belongs to the interval 

fl+i _n-^) 

V n n — 1 / 

then (one can show that) a n + ] lies in the interval (0,1) and the proof is 
complete by previous reasoning. 

Thus, in all cases (for which the sequence is defined) the sequence 
converges. 


7*6.5. Let f(x) be a function such that/(l) = 1 and for x > 1 


fV) 


1 

x 2 +f\x) ‘ 


Prove that lim^^/f.v) exists and is less than 1 + i ff. 


Solution. By the fundamental theorem of calculus 


Observe that f(x) is increasing; moreover, f(x) >1 for all x > 1, since 
/(l) = 1 and f'(x) > 0. Therefore 




dx 

* 2 + / 2 (*) 



dx 

1 + x 2 


= arc tan x]^ 

= arctanx — arctan 1 

<iff- — iff = iff. 

Thus, f(x) is increasing and bounded above by 1 + iff, and consequently, 
lim^^/fx) exists and is less than 1 + |ff. 


7,6.6. Consider all the natural numbers which represented in the decimal 
system have no 9 among their digits. Prove that the series formed by the 
reciprocals of these numbers converges. 



278 


7. Inequalities 


Solution. Let S m denote the mth partial sum of the series under consider¬ 
ation. The sequence {S m } is monotone increasing, so to prove convergence, 
we need only prove that the sequence is bounded. 

For a given partial sum S m , let n denote the number of digits in the 
integer m. The number of integers of exactly n digits which have no 9 in 
their decimal representation is 8 X 9" ” 1 (the first digit cannot be zero). 
Therefore, the sum of their reciprocals is less than 8 X 9 n_l /10 rt_, . Thus 

S m <8 + 8x^ + 8x(^) 2 + - + 8x(^) n -‘ 

< 8 [‘ + tfc + (£) 2+ ---] = 8 o, 

and the proof is complete. 


Problems 


7.6.7. Prove the inequalities which follow and apply the squeeze principle 
to evaluate a limit: 


(a) -p?= < £ - 7 =!= < • 

V n 2 + n y n 1 + i yn 2 + 1 

(b) a < (a n + b n )^ n < a"^2 ,0 < a < b. 

(c) e'~ 1/(2n > < (1 + 1 /«)" < 1 /< 2 ")i/O" 1 ), 

7.6.8. Prove that each of the following sequences converges, and find its 
limit: 


(a) vT,Vi+^,yiTVi+7r V i+ V> +V> +7r . 

(b) , 



7.6.9. Prove that the sequence {a„} defined by 

«»“l + 5+ -‘-+^ - to 8« 

converges. 

7.6.10. Prove that the sequence {a n } defined by 

6(1 + «„) 


converges, and find its limit. 
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7.6.11. Let a, and b x be any two positive numbers, and define {a„} and 

\K) 

„ _ u _ r. z 

a " ~Z V a «- ]b„-1 • 

a n- 1 ■ f ‘ °n- I 

Prove that the sequences { a n } and {b n } converge and have the same limit. 

7.6.12. S t = logo, and S„ = 2/- ' log (<2 — S ; ), n > 1. Show that 

lim S = a — 1. 

(Hint: Note that *S n + 1 = S„ + log(a + £„).) 

7.6.13. The sequence Q n (x ) of polynomials is defined by 

0 i (jc) = 1 + jc, Q 2 (x)= l + 2x, 

and for m > 1, 

Qim+ii*) = CM*) + (« + I )*£Mi(*). 

Q 2 m + 2 ( X ) = Q 2 m + l( x ) + (m+ 1 )xQ 2m (x). 

Let be the largest real solution of Q„(x) = 0. Prove that { x„} is an 
increasing sequence and that lim^^x,, = 0. 

7.6.14. Prove that if converges, so does 2if -\( a */ n )- 

7.6.15. Prove that 

2 2 X 4 2 X 6 2 X ■ • * X (2n) 2 
A™ (1 x' 3)(3 X 5) - ■ - ((2« - 1)(2« + 1)) 


2 2 X 4 2 X 6 2 X ■ • * X (2«) 


,2 x 32 x 5 2 X * • * X (2/i - 1/ V + 1 / l 

(Hint: For O<0<i7r, j^ 2 sin 2n + l 0de < f% /2 sin 2n $d0 < /^sin 2 " -1 ^. 
Apply the result of 2.5.14 together with the squeeze principle.) 


y * n ) { 1 1 

„ _ n 2 l 2n + 1 / 2 ’ r ‘ 


Additional Examples 

6.1.5, 6.3.7, 6.4.4, 6.6.2, Section 6 . 8 , 6.94. Also, see examples of “repeated 
bisection” in Section 6.1. 



Chapter 8. Geometry 


In this chapter we will look at some of the most common techniques for 
solving problems in Euclidean geometry. In addition to the classical syn¬ 
thetic methods of Euclid, we will see how algebra, trigonometry, analysis, 
vector algebra, and complex numbers can be useful tools in the study of 
geometry. 


8.1. Classical Plane Geometry 


In this section we will review the ideas and methods characteristic of 
classical plane geometry: namely, the study of those properties of triangles, 
quadrilaterals, and circles that remain invariant under motion (e.g., transla¬ 
tion, rotation, reflection). We will be concerned with synthetic geometry, 
which builds on an understanding of the basic notations of congruency, 
similarity, proportion, concurrency, arcs and chords of circles, inscribed 
angles, etc. In addition, we wish to draw attention to the importance of 
algebraic and trigonometric techniques for proving results in traditional 
Euclidean plane geometry. 


8.1.1. Find the area of a convex octagon that is inscribed in a circle and 
has four consecutive sides of length 3 units and the remaining four sides of 
length 2 units. Give the answer in the form r + , with r, s, and t positive 

integers. 
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We will give several solutions to this problem to illustrate the variety of 
methods that are at ones disposal in this subject. 


Solution 1. Let the vertices be labeled ABCDEFGH as shown in Figure 8.1 
where AB - BC = GH = HA = 3 and CD = DE = EF = FG = 2. Let O 
denote the center of the circle. 

We first find the area of A OA B and A ODE. For this, it suffices to find 
the altitudes OK and OJ. 

Notice that OK ~ \ EB\ this follows because O is the midpoint of EA 
and K is the midpoint of AB. Similarly, OJ = \AD, and therefore, it 
suffices to find DI, I A, El, and IB, where / is the intersection of AD and 
EB. 

By angle-side-angle, &DBC —‘ &DBI, and therefore DI = 2 and 
IB = 3. Furthermore, since a ADE and a ABE are each inscribed in a 
semicircle, LADE and LABE are right angles. Therefore, A IB A and 
A EDI are isosceles right triangles, and it follows that IA — 3V2 and 
El = 2,f2. 

We can now find the area of the octagon: 


Area 



- 13 + 12# . 


Solution 2. Perhaps the easiest solution is based on recognizing that the 
area of the octagon is the same as either of those shown in Figure 8.2, 
having alternating sides of lengths 2 and 3. The area can be computed by 
subtracting four triangular regions from a square, or by adding the areas of 
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Figure 8.2. 

a square, four rectangles, and four triangles. Thus, for the diagram on the 
left, we have (x = \ /2) 

octagon area = (2x + l) 2 — 4(^x 2 ) 

= 2x 2 + 8x + 4 

= 2(JJ2) 2 +Sx^ + 4 

= 13 + 13/2 . 

Or, for working from the inside on the figure on the right (y = /2 ), 
octagon area = 9 + 4(3y) + 4(4 >' 2 ) 

= 9 + 12/2 + 2x2 
= 13 + 12/2 . 

Solution 3. Let R denote the radius of the circle. The area of the octagon is 
equal to four times the area in quadrilateral OA BC (see Figure 8.3). Clearly 

Area OABC = Area A CM C + Area A ABC, 


A 



0 R c 

Figure 8.3. 
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R 2 = $ + 3/2 . 

The final result then follows after substituting this value for R 2 into the 
preceding equation for Area OA BC. 


Solution 4. In Figure 8.4, D and E are the feet of perpendiculars drawn 
from B to OA and OC respectively. Let x = OE and j-’ = OD, and let R be 
the radius of the circle. Then 

area of octagon = 4(area of quadrilateral OA BC ) 

= 4(Area a OAB + Area a OCB ) 

= 4 [\Rx + \Ry] 

= 2 R(x +y). 


A 



0 EC 

Figure 8.4. 
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Our plan is to express x + y in terms of R, and then use the fact that 
R 2 = + 3/2 (see the last solution). 

The Pythagorean theorem applied to A ABD yields jc 2 + (/? — _v) 2 = 4, 
or equivalently, 2K(/( — 7 ) = 4 (note: x 2 + y 2 = R 2 ). Similarly, from 
A EBC we have y 2 = 9 - (/l - *) 2 , or equivalently, 2R(R - .r) = 9. Add¬ 
ing R -y = 4/(2 R) and R - x = 9/(2R) yields 2R - (x + j>) = 13/(2 R), 
or equivalently x + y = (4 R 2 — 13)/(2/t). Substituting, we find 

j = 4R 2 - 13 

= 4(^ + 3/2 j — 13= 13+ 12/2 . 


area of octagon = 2 R 


AR 2 — 13 


Solution 5. The octagaon can be cut like a pie into eight triangular pieces 
with equal sides of length R (equal to the radius of the circumscribed circle) 
and with bases 3 and 2. Let H and h denote the altitudes of these triangles 
as shown in Figure 8.5. Then 

area of octagon = 4(^3 ■ H) + 4(^2 • A) 

= 6* H + 4* h. 

With a and ft as shown in Figure 8.5, we have the following relation¬ 
ships: a + ft = tt/ 4; sino = 3/(XR)\ cos <tx=H/R\ sin j8 = 1 /R; cos/3 
= hjR. From these, we find 

R = _1 _ _ 1 

sin ft sin(t^r — a) 

= _ ! _ 

j/2 coso — t/2 sina 

= jf(ll/R-3/2R) 

It follows that 

1 /2 (2U — 3) 




Figure 8.5. 
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or equivalently, 

Using this, 

h = R cos fi = /([cos^tt — a)J = j/2 cosa 4- j /2 sinaj 

= ^V2 [2(| + V2 ) -H 3] 

= l+§/2 . 

Substituting, 

area of octagon = + /2 ) + 4(l + |/2 ) - 13 + 13/2 . 

8.1.2. If A and B are fixed points on a given circle and AT is a variable 
diameter of the same circle, determine the locus of the points of intersection 
of lines AX and BY. (You may assume that AB is not a diameter.) 

Solution. Consider Figure 8.6, where A and B are fixed points on the 
circumference of a given circle. Let B’ be the point on the circle diametri¬ 
cally opposite of B. Let P and P' denote respectively the intersection of AX 
and BY when this intersection lies inside or outside the circle (depending 
upon which side of the line BB ' the point X falls; see figure). 

In the first case, Z APB = 90° + I(Arc AB), and this is a constant value 
for all diameters which result in an “inside” intersection point P. This 
implies that P lies on the circle formed by- those points making a constant 
angle (namely, 90° + t(Arc/45)) with the constant base AB. 

In the second case, LAP'B = 90° — \{AicAB), and this is a constant 
value for all diameters which result in an “outside” intersection point P’. 



Figure 8.6. 
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Figure 8.7. 

Thus, P' lies on a circle which passes through A and B. Furthermore, 
LAPS and LAP'B are supplementary angles (Z APB + YAP'B = 
90° + ^{McAB) + 90° - £(Arc/4fl) = 180°), and therefore APB?' is a 
cyclic quadrilateral; that is to say, P and P' lie on the same circle through 
A and B. 


8.1 3. P is an interior point of the angle whose sides are the rays OA and 
OB. Locate X on OA and Y on OB so that the line segment X Y contains P 
and so that the product of distances ( PX)(PY ) is a minimum. 


Solution. This problem was solved in 6.4.2 by using methods of analysis. 
Here we will solve it geometrically. 

Let OC be the line bisecting Y^AOB, and let L denote the line through P 
which is perpendicular to OC. Let X and Y denote the intersections of L 
with OA and OB respectively (see Figure 8.7), 

Now, OX = OF, so there is a circle tangent to OA at X and OB at Y. 
Let X t F[ be any other segment containing P with on OA and Fj on OB. 
Let X 2 and F 2 be the intersections of X } F, with the circle. Then (PX)(PY) 
= (PX 2 )(PY 2 ) < (FA' I )(/ > F I ), so (PX)iPY) is the minimum. 

8.1.4. Let P be an interior point of triangle ABC, and let a, y,z denote the 
distances from P to BC, AC, and AB respectively. Where should P be 
located to maximize the product xyzt 


Solution. Let a,b,c denote the lengths of the sides BC, AC, and AB 
respectively (Figure 8.8). By the arithmetic-mean-geometric-mean inequal¬ 
ity, 
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C 



Figure 8.8. 

But we know that ax + by + cz = 2A, where A is the area of the triangle. 
Thus, the maximum value of xvz is &A 3 /(21abc), and this occurs if and 
only if ax = by = cz. 

We will show that ax = by = cz if and only if P is located at the centroid 
of /\ABC. For this, suppose that CP intersects AB at D. Let a, ft, y,S be 
the angles as shown in Figure 8.9. It is known that 

b sin ft = AP 
a sin a DB 

(This relationship is useful in many problems. To see that it is true, apply 
the law of sines to A ADC and to {\CDB to get 

= and 

sin ft siny sin a smo 

Using these equations it follows that 

£>sin ft AD siny _ AP 

nsinn Z>5sin5 DB ’ 

since y and S are obviously supplementary.) 

Using the above equation, we have 

AD _ ^sin ft = by/(CP) ~ by 
DB a sin a ax/(CP) ax 

and it follows that AD ~ DB if and only if by = ax. Thus, ax = by if and 
only if P is on the median line from C. 


C 
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In a similar manner, ax = cz if and only if P is on the median line from 
B . It follows that ax — by = cz if and only if P is the centroid of A ABC. 


Problems 


8 . 1 . 5 . Show that a triangle must be equilateral if any pair of the following 
centers coincide: incenter, circumcenter, centroid, orthocenter. 

8 . 1 . 6 . An acute triangle is inscribed in a circle. The resulting three minor 
arcs of the circle are reflected about the corresponding sides of the triangle 
(i.e., arc AB is reflected about side AB, etc.; see Figure 8.10). Are the 
reflected arcs concurrent? 

8 . 1 . 7 . Let Cj and C 2 be circles of radius 1, tangent to each other and to the 
x-axis, with the center of C Y on the y-axis. Now construct a sequence of 
circles C„ such that C„ +l is tanget to C„_ h C„, and the x-axis. 

(a) Find the radius r n of C„. 

(b) Show that the length of the common tangent included between its 
contacts with two consecutive circles C„_, and C„ is (?) ~ 1 . 

(c) From part (b) and the geometry of the problem, show that 



8 . 1 . 8 . If a,b,c are the sides of a triangle ABC, , t c are the angle 
bisectors, and T a ,T b ,T c are the angle bisectors extended until they are 
chords of the circle circumscribing the triangle ABC, prove that 

abc = ^T a T b T c t a t b t c . 

(Hint: Prove that T a t a = be, etc.) 
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8 . 1 . 9 . 

(a) Given a point P inside an arbitrary angle XOY, let AB be a segment 
through P such that AP = PB, and let MN be any other line through P 
which intersects OX and OY at M and N respectively. Prove that the 
area of A MON is greater than or equal the area of a AOB. 

(b) Let AD and AE be tangent to a circle, and let P be an arbitrary point 
on the minor arc. Let BPC be a tangent to the same circle. Show that 
the perimeter of A ABC is constant for all positions of P on the minor 
arc. 

(c) In the setup of part (b), let MN be any other line through P which 
intersects AE and AD in M and N respectively. Prove that the perime¬ 
ter of is ABC is smaller than the perimeter of A AMN. 

8 . 1 . 10 . A quadrilateral ABCD is inscribed in a circle (see Figure 8.11). Let 

* = BD, y = A C, and a, b, c, d be the lengths of the sides as indicated. 

Construct / CDE equal to CABD. L AO& 

(a) Prove that a CDE~ /\ADB arid hence that EC • x = ac. 

(b) Prove that i±ADE~~&BCD and hence AE ■ x = bd. 

(c) From parts (a) and (b), prove Ptolemy’s theorem (an important fact 
about cyclic quadrilaterals): In a cyclic quadrilateral the product of the 
diagonals is equal to the sum of products of the opposite sides. 

8 . 1 . 11 . 

(a) A line from one vertex of an equilateral triangle ABC meets the 
opposite side BC in a point P and the circumcircle in Q. Prove that 

J _L + _L 

PQ BQ CQ ' 

(b) Using the notation of part (a), prove that AQ 4 + BQ 4 + CQ 4 is con- 
sant for all positions of Q on the minor arc BC. (Hint: For a trigo¬ 
nometric approach, let x = AQ, v * BQ, z = CQ, and 8 = Z BAQ. 
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Show that x = (2/VT)sin $. z = (\/^)[cos8 — sin0], y = x + z. Also, 
see 8.4.6.) 

8 . 1 . 12 . In Figure 8.12 we are given an inscribed triangle ABC. Let R 
denote the circumradius; let h a denote the altitude AD. 

(a) Show that triangles ABD and ALC are similar, and hence that h a = 2 R 
= be. 

(b) Show that the area of &ABC is abc/AR. 

8 . 1 . 13 . The radius of the inscribed circle of a triangle is 4, and the 
segments into which one side is divided by the point of contact are 6 and 8. 
Determine the other two sides. 

8 . 1 . 14 . Triangles ABC and DEF are inscribed in the same circle. Prove 
that 


sin A + sin B + sin C = sin D + sin E + sin F 
if and only if the perimeters of the given triangles are equal. 

8 . 1 . 15 . In the following figure, CD is a half chord perpendicular to the 
diameter AB of the semicircle with center O. A circle with center P is 
inscribed as shown in Figure 8.13, touching AB at E and arc BD at F. 


D 



8.2. Analytic Geometry 


291 


Prove that A A ED is isosceles. (Hint: Label the figure and make good use 
of the Pythagorean theorem.) 

8 . 1 . 16 . Find the length of a side of an equilateral triangle in which the 
distances from its vertices to an interior point are 5, 7, and 8. 


Additional Examples 

1.2.1, 1.3.14, 1.4.2, 1.6.1, 1.6.10, 1.8.3, 1.8.7. 

8.2. Analytic Geometry 


The introduction of a coordinate system makes it possible to attack many 
geometry problems by way of algebra and analysis. 

8 . 2 . 1 . Let P be a point on an ellipse with foci F, and F 2 , and let d be the 
distance from the center of the ellipse to the line tangent to the ellipse at P 
(Figure 8.14). Prove that (PF^PF 2 )d 2 is constant as P moves on the 
ellipse. 


Solution. Place coordinates on the plane in such a way that the ellipse has 
the equation 

Y-2 Xp- 

~ + 0 < b < a. 

a b 



Figure 8.14. 
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The technique is straightforward: compute PF U PF 2 , and d (as functions of 
the x -coordinate of P), and check to see if the required product is a 
constant. 

Let the coordinate of P be (a, /?). The focal points F y and F 2 have 
coordinates (± c,0), where c 1 = a 2 — b 2 . Therefore we have 

PF } = yjfi 2 + (a + c) 2 , 


PF 2 = {p 2 + (a-cj 2 . 

To find d 2 , it is necessary to write the equation of the tangent to the 
ellipse at P(a, /?). To find the slope of the tangent at P, we compute the 
derivative: 


so that 




v . = ~ 2 x /a 2 = _ tfx 
2 y /b 2 aty ' 

It follows that the equation of the tangent at /*(«, fi) is 


or equivalently, 

a 2 fiy + b 2 ax = b 2 a 2 + a 2 $ 2 . 

But a 2 /a 2 + fi 2 /b 2 = 1, since P(a, j8) is a point on the ellipse, and 
therefore a 2 b 2 + a 2 /J 2 = a 2 b 2 . Hence the equation of the tangent at P(a, ft) 
is 


ab 2 x + /J a 2 y — a 2 b 2 = 0. 

Now recall the formula for the distance D from a point Q(c,d) to the 
line Ax + By + C = 0: 

|/4c + Brf+C| 

■JA 2 + B 2 

In our case, the distance d from the origin to the tangent line is 

a 2 b 2 


d = 


yja 2 b A + 0 2 a 4 


We now need to examine the product d 2 (PFiXPF 2 ). We can eliminate fi 
in each of these factors, since 


02 _ a 2 b 2 — a 2 b 2 
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We have 

d 2 = __ 

a 2 b 4 + (( a 2 b 2 - a 2 b 2 )/ a 2 )a 4 

= aV 

a 2 b 4 + a 4 b 2 — ot 2 a 2 b 2 

= a 4 b 4 _ 

b 2 [a 2 b 1 — a 2 a 2 ] + a 4 b 2 

= aV 
b\-c 2 a 2 ) + a 4 b 2 

_ a% 2 

a 4 - c 2 a 2 ’ 
and 


PF 2 = p 2 + (a + c) 1 

= aV^jxW +a 2 + 2ac + c 2 
a 2 

_ a 2 b 2 — a 2 b 2 + a 2 a 2 + 2a 2 ac + a 2 c 2 
a 2 

a 2 (b 2 + c 2 ) + a 2 (a 2 — A 2 ) + 2a 2 c 


_ a 4 + 2 a c« + c 2 a 


2„1 


Similarly, 


(a 2 + c«) 


PF 


2 - ~ C< *) 


Thus, 




= a 2 b 2 . 


This completes the proof. 


8,2,2. Suppose that (.v 2 ,.>’ 2 ), (* 3 , >’ 3 ) are three points on the 

parabola ^ 2 = ax which have the property that their normal lines intersect 
in a common point. Prove that _y t + y 2 + _y 3 = 0. 
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Figure 8.15. 


Solution. The solution falls out as a by-product of the following analysis. 
Let (a, (i) be the coordinates of the intersection of the three norma! lines 
(Figure 8.15), and let (a, y) be an arbitrary point on the parabola. The 
slope of the line through (a, y) and (a, j8) is (y — f3)/(x — a). The slope of 
the tangent to the parabola at (a, v) isy' = a/(2y), and therefore, the slope 
of the normal line at (x,y) is —2 y/a. It follows that (A,,y,), (*2,^2)’ 
(a 3 , _y 3 ) satisfy the equation 

y ~ 0 _ ( 2 y\ 

x - a V a )' 

Replacing a: by y 1 f a, this equation is 



a 2 (y — j8) = —ly? + 2 aay. 

Thusyi, y 2 ,^ 3 are the three roots of the cubic equation 
2y 3 + a(a - 2 )y — a 2 = 0. 

Now, remembering how the coefficients of a cubic equation are related to 
the roots (see Section 4.3), we see thaty, + _y 2 + y 3 = 0 (the coefficient of y 2 
is zero). 

8 . 23 . A straight line cuts the asymptotes of a hyperbola in points A and B 
and the curve in points P and Q. Prove that AP = BQ. 


Solution. We may assume the hyperbola and the straight line have the 
equations 


xy — \ 


(1) 
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respectively. (The hyperbola can be taken to have this form by appropriate 
scaling followed by a rotation, each of which takes straight lines into 
straight lines and preserves ratios of line segments.) 

The asymptotes of the hyperbola are the x and y axes (Figure 8.16); so 
let A be the x-intercept of the line, and let B be the ^-intercept. Let (x t , y ,) 
and (x 2 , y 2 ) be the coordinates of P and Q. Substituting y = 1 /x into (2) 
yields 

x 2 — ax + a/b = 0, 

and since x, and x 2 are roots of this equation, we know that 

x t + x 2 = a. 

Similarly, substituting x = \fy into (2) yields 

y 2 -by + b/a = 0, 

and this implies that 

Pi +P2 = b. 

It follows that 

AP 1 = (jc, - a) 2 + y i 

= (a-x 2 - af+(b-y 2 f 
= xl + {b-y 2 f 

= bq\ 

and the result follows. 


8.2.4. Determine all the straight lines lying in the surface z = xy. 
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Solution. The parametric equation for the line through (a,,a 2 ,a 3 ) with 
direction {d v d 2 ,d 3 ) ts given by 

x = a i + d x t, 
y - a 2 + d 2 t, 

z = a 3 + d 3 t. 

For such a line to lie in the surface z = xy it is necessary and sufficient that 
for all t , 

a 3 + d 3 t - (a, + d x t){a 2 +d 2 )t 


= a i a 2 + (a 2 d t + a } d 2 )t + d i d 2 t 2 . 

It follows that d t d 2 = 0, and d t and d 2 cannot both be zero, since this would 
imply that d, = d 2 = d 3 = 0, a contradiction. 

If d 2 = 0, then 

a 3 + d 3 t = a z (a, + d x t), 


or 


If d t = 0, then 


z = a 2 x. 


a 3 + d 3 t = a,(a 2 + d 2 t). 


or 

z = a t y. 

Thus, the only straight lines in the surface z = xy are of the form z = ax, 
y = a or of the form z = ay, x = a, where a is an arbitrary constant. 

8 . 2 , 5 . An equilateral triangle ABC is projected orthogonally from a given 
plane P to another plane P'. Show that the sum of the squares of the sides 
of the resulting triangle A'B’C' (Figure 8.17) is independent of the orienta¬ 
tion of the triangle ABC in P. 


Solution. First, some observations about how lengths are transformed under 
this projection. Suppose that AB is a line segment in P of length one, and 
that it makes an angle tp with the line L of intersection of P and P'. Let 8 
denote the angle between the planes. Locate C so that A ABC is a right 
triangle, and AC is parallel to L (see figure). Triangle ABC will project into 
a right triangle A'B'C'. Furthermore, AC and A'C' have the same length, 
and B'C' = BCcosff. Since AC = cos<f> and BC = sin <f>, it follows that 

A'B' = V(cos <f>) 2 + (sin^cosd) 2 . 

Now, let ABC denote an arbitrary equilateral triangle in P. We may 
suppose that the length of the side is one. Suppose that AB makes an angle 
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P 



<f> with L. Then BC and CA will make angles of <£> + j tt and + § w with L. 
Applying the result obtained above, we find that the sum of the squares of 
the sides of triangle A'B'C' is 

[(cos^t) 2 + (sin cos# ) z ] + J(cos(<f> + + (sin(<£ + ^w)cos#) 2 J 

+ ^(cos($ + fw)) 2 + (sin(^ + §ff)cos#) 2 j, 
which reduces to 

3 cos 2 # + | sin 2 #, 

which is independent of <(>. 


Problems 


8.2.6. Let the triangle ABC be inscribed in a circle, let P denote the 
centroid of the triangle, and let O denote the circumcenter. Suppose that 
A,B, C have coordinates (0,0), (a, 0), and (h,c) respectively. 

(a) Express the coordinates of P and O in terms of a,b,c. 

(b) Extend line segments AP, BP, and CP to meet the circle in points D, E, 
and F respectively. Show that 

AP . BP . CP _ -i 
PD PE PF 

(Hint: One way to proceed is the following: Let x denote OP, and let R 
denote the radius of the circumcircle. Then 

A_P . BP CP _ AP 2 + BP 2 + CP 2 

PD PE PF R 2 — x 1 
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Now express each of the terms on the right side in terms of a, b,c [using the 
results of part (a)].) 

8.2.7. Find the relation that must hold between the parameters a,b, c so 
that the line x/a + yjb = 1 will be tangent to the circle 

8.2.8. Equilateral triangles whose sides are 1,3,5,7, . .. are placed so that 
the bases lie comer to corner along the straight line. Show that the vertices 
lie on a parabola and are all at integral distances from its focus. 

8.2.9. 


(a) Tangents are drawn from two points (a,b) and (c,d) on the parabola 
y — x 2 . Find the coordinates of their intersection. 

(b) Two tangent lines, L, and L 2 , are drawn from a point T to a parabola; 
let P and Q denote the points of tangency of E, and E 2 respectively. 
Let E be any other tangent to the parabola, and suppose E intersects E, 
and L 2 at R and S respectively. Prove that 


TR . TS 
TP TQ 


= 1 . 


8.2.10. A parabola with equation y 2 = ax is cut in four points by the circle 
(x — hf + {y — k) 1 = r 1 . Determine the product of the distances of the 
four points of intersection from the axis of the parabola. 

8J.11. Let b and c be fixed real numbers, and let the ten points (j, yj), 

j = 1,2, ..., 10, lie on the parabola y = x 2 + bx + c. For j *= 1,2.9, 

let Ij be the point of intersection of the tangents to the given parabola at 
(j, Yj) and (j + 1, y J + [). Determine the polynomial function y = g(x) of 
least degree whose graph passes through all nine points 

8.2.12. Prove or disprove: there is at least one straight line normal to the 
graph of y = cosh x at a point (a, cosh a) and also normal to the graph of 
y = sinh x at a point (c, sinh c). 

8.2.13. 

(a) Show that the tangent lines to ellipse x 2 /a 2 + y 2 /b l = 1 have the form 

y = ax ± ( a 2 a 1 + b 2 ^' /2 , 

and vary in position with different values of a. (Because of the great 
utility of this form, particularly in problems of tangency which do not 
involve the consideration of the point of contact, this is called the 
magical equation of the tangent .) 

(b) Find the equation of the tangents to the ellipse 3x 2 +j 2 = 3 which 
have slope of one. 

(c) Find the area of the triangle formed by a tangent to the ellipse (say of 
slope m ) and the two coordinate axes. 
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8.2.14. 

(a) Let D be the disk x 2 + y 2 < 1. Let the point A have coordinates (r, 0), 
where 0 < r < 1. Describe the set of points P in D such that the open 
disk whose center is the midpoint of AP and whose radius is AP/2 is a 
subset of D. 

(b) Let D be the disk x 2 + y 2 < 1. Let points A and B be selected at 
random in D. Find the probability that the open disk whose center is 
the midpoint of AB and whose radius is AB/2 is a subset of D. 

8.2.15. Given an ellipse x 2 / a 2 + y 2 /b 2 =1, ay= b, find the equation of 
the set of all points from which there are two tangents to the ellipse whose 
slopes are reciprocals. 

8.2.16. If two chords of a conic are mutually bisecting, prove that the 
conic cannot be a parabola. 

8.2.17. Prove that the graph of a cubic equation is symmetric about its 
point of inflection. (Note: If the conic equation is/(x) = a.v 3 + bx 2 + c.v + 
d, the ^-coordinate of the inflection point is — b/3a.) 


Additional Examples 

1.3.11, 1.5.3, 1.5.8, 1.6.4, 3.1.4, 4.3.6, 4.3.7. 

8.3. Vector Geometry 


In this section we will think of vectors as quantities which have both 
magnitude and direction. Examples of vector quantities include force, 
velocity, and acceleration. We shall see that vectors can also be used 
advantageously in geometry problems. 

We will represent vectors by arrows (i.e., directed line segments) in the 
Euclidean plane. The direction of the arrow indicates the direction of the 
vector, and the length of the arrow indicates the magnitude of the vector. 

Two vectors are equal if they have the same length and the same 
direction. It is important to realize that two vectors may be equal without 
being col linear. 

_If P and Q are two points, the vector from P to Q will be denoted by 

PQ. The length, or magnitude, of PQ will be denoted by \PQ\. 

The sum, A + B, of vectors A and B is given by the parallelogram law 
(see Figure 8.18), or equivalently, by completing the triangle as in Figure 

8.19. The difference, A — B, of B from A , is shown geometrically in Figure 

8 . 20 . 
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A 

Figure 8.18. 



1 

Figure 8.19. 



B 

Figure 8.20. 


Place coordinates on the plane and denote the origin by 0. .Each point P 
in the plane determines a unique vector, OP, called the position vector of 
P\ we will often denote this vector simply by P (instead of OP). 

Suppose that P and Q are the position vectors of two points P and Q 
(Figure 8.21). Let R be a point on the directed line segment PQ which 
divides PQ in the ratio m : n. (Figure 8.22). Then the position vector of R is 

0 



Figure 8.22. 
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given by 


(m + n)P + m( Q - P) 
m + n 


It is instructive to think of /f in physical terms in the following way. 
Imagine a weightless bar PQ with a mass of n/(m + n) at P and a mass of 
m/(m + n) at Q. The center of mass of the resulting system will be at a 
point X on PQ where “the seesaw balances,” that is, at the point X where 

(—?—)PX = (-!?-)XQ. 

\m + n f \m + n/ ^ 

But this is the same as 

PX _ m 
XQ n ' 

Thus, X divides PQ into the ratio m : n; in other words, X = R = (nj{m + 
n))P + (m/(m + n))Q. The coefficients n/(m + n) and m/(m + n) can be 
thought of as “weighting factors.” Increasing the proportion of “weight” at 
P moves the point R toward P and decreases the ratio m : n, etc. 


83.1, In a triangle ABC the points D,E,F trisect the sides so that 
BC = 3 BD, CA = 3 CE, and AB = 3 AF (Figure 8.23). Show that triangle* 
ABC and DEF have the same centroid. 

Solution. We will first show that the position vector of the centroid of an 
arbitrary triangle PQR is given by + + To see this, remember 

that the centroid of triangle PQR is located at a point | of the way from P 
to the midpoint of QR. From the discussion preceding the problem, we 
know that the position vector for the midpoint of QR R, and 


C 
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therefore, the position vector of the centroid of PQR is + + i 5), 

and this is equal to + + as claimed. 

Because of the way D, E, and F are defined, we have 

D = \B + \C, 

E=\C + \A, 

F=±A + \B, 

and therefore 

centroid of A DEF = + + 

-i[j*+jc] + t[jc + ^] + j[H' + ii] 

= ±A + + 1C = centroid of A ABC. 

83.2. Prove that it is possible to construct a triangle with sides equal and 
parallel to the medians of a given triangle. 

Solution. Consider a triangle ABC, and let D , E, F be the midpoints of sides 
BC, AC, and AB respectively (see Figure 8.24). Then 

AD = AB + jBC, 

BE = BC + \CA, 

CF=CA + 1 AB. 

Adding these, we find that AD + BE + CD = (AB + 5(7 + CA) + + (BC + 
CA + AB) = 0 + (()-0 = 0. This implies that the vectors AD, BE, and CF 
form a triangle. But AD, BE, and CF are equal in magnitude and direction 
to the medians of triangle ABC. 

Before considering the next examples we will develop the following basic 
principle. Suppose that P, Q, and R are points which are not collinear 


c 
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Q 



(Figure 8.25). If aPQ + bPR — cPQ + dPR, then a = c and b = d. For if 
the condition holds and if a c, then 

and this implies that P, Q, R are collinear (vectors PQ and PR have the 
point P in common), which is a contradiction. Therefore a = c. In a similar 
manner b = d. 

833. Prove that the line joining one vertex of a parallelogram to the 
midpoint of an opposite side trisects a diagonal of the parallelogram. 


Solution. Label the parallelogram by A , C, D as shown in Figure 8.26, let 
F be the mi dpo int of DC, a nd le t E be the intersection of A F and BD. Note 
that AB = DC and AD = BC, because as vectors they have the same 
magnitude and the same direction. 

The point E is at the intersection of two lines. We can express this 
algebraically by saying that there exist constants a and b such that 

AE = aAF, 

AE=AB + bBD. 

Therefore, 


AB + bBD = aAF. 

The ide a is to express each of the vectors in this last equation in terms of 
AB and AD, and then we will make use of the principle discussed prior to 


D f c 



Figure 8.26, 
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the statement of the problem. Thus we have 

AB + b(AD -AB) = a(AD + {AB), 

(1 - b)AB + bAD = {aAB + aAD. 

It follows that 

I -b = {a, 
b = a. 

These equations imply that a = b = and the result follows. 

83.4. In triangle ABC (Figure 8.27), let D and E be the trisection points of 
sides BC with D between B and E, let F be the midpoint of side AC, and 
let G be the midpoint of side A B. Let H be the intersection of segments EG 
and DF. Find the ratio EH : HG. 

Solution. The plan is exactly as in the preceding problem. There are 
constants a and b such that 

AG + aGE = AF + bFD. 

Now express each of the vectors in terms of AB and AC: 

AG = {AB, 

GE = GB + BE = {AB + {BC 

= \AB + l(AC-AB)= -{AB + {AC, 

AF= {AC, 

FD = FA+AB + BD - {AC + AB + {BC 

- {AC+AB + {(AC-AB) - {AC + § AB. 

Substituting these into the previous equation, we have 

{AB + a[ - {AB + {AC] = {AC + b[ - {AC + f AB], 

- {a)AB + jaAC = \b~AB + (i - {b)AC . 
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It follows that 

ffl-i-i*. 

or equivalently, 

a + 4b = 3, 

4a + b ~ 3. 

The solution to this system is a = b = \, and it follows that EH: HG 
= 2:5. 

83.5. Given a triangle ABC, construct similar isosceles triangles ABC and 
ACB' outwards on the respective bases AB and AC, and BCA' inwards on 
the base BC (Figure 8.28). Show that AB'A’C' is a parallelogram. 

Solution. In vector language, our problem is to show that AB' + AC' — 
AA'. 

Let D, E, F be the midpoints of sides AB, BC, AC respectively. Then 
AB' = AF + FB' = \AC + FB’ 

AC' = AD + DC’ = FAB + DC' 

AA' = AB + BE + EA' 

= Jbf\{ac-ab) + ea' = {ab + \2c + ea'. 

To put FB', DC', and EA' into terms of AB and AC, we introduce the 
following notation (useful in other problems as well). Given points P and 
Q, let \PQ denote the vector obtained by rotating PQ, with unchanged 
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Figure 8.29. 


magnitude, through a right angle in the positive direction, as in Figure 8.29. 
Now, suppose the isosceles triangles erected on the sides of ABC have 
height-to-base ratio equal to k; that is, FB'/AC = DC' / AB = EA'/BC 
= k. Then, 


AB' = \AC + FB' = \AC + k\AC, 
AC' = \AB + DC' = \AB - k \AB, 
AA' = jAB + \AC + EA 
= \AB + {AC + k\BC 
= \ABA-\ACA-k\(AC-AB) 
= i AB + \AC + k\AC-k \AB. 


(Note that |(P + Q) = \ P + | Q, and |aP = a \ P for an arbitrary constant a.) 
These expressions for AB, AC', and A A' show that AB A-AC' = AA', and 
thus the solution is complete. 

Given vectors PQ and RS, the dot product PQ * RS is defined by the 
formula 


PQRS= |P0l|/lS|cos0, 

where 9 is the angle between the vectors, 0 < 9 < 180°. 
It can be shown that for arbitrary vectors A, B, C, 

A B= B A 

and 


A (B+C) = A B+A ■ C. 

Notice that if A and B are perpendicular, then A ■ B = 0. Conversely, if 
A • B = 0, then either ^ = 0 or B = 0, or A and B are perpendicular. Also, 
notice that A ■ A = \A\ 2 . 

83.6. In triangle ABC (Figure 8.30), AB = AC, D is the midpoint of BC, 
E is the foot of the perpendicular drawn D to AC, and F is the midpoint of 
DE. Prove that AF is perpendicular to BE. 
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Solution. This is the same problem as 1,5.3, but here we will give a proof 
using vector notation. We have 


AF - BE = (AE + EF) • (BD + DE) 


= AE - BD + EF - BD + EF - DE 
= {AD + DE)BD+EFBD+EFDE 


DE-BD + EF-BD + EF- DE 


= DE-DC- 


DE DC 
2 


DE-DE 

2 


DE-DC __ DE-PE 
2 2 


= DE- 




= DE-EC 
2 


= 0 . 


The concept of vector makes sense in Euclidean 3-space just as it does in 
the Euclidean plane. Just as in the case of the plane, vectors have length 
and direction and are represented as arrows or directed line segments (but 
now in 3-space). They are added by the parallelogram law, and can be 
manipulated to prove results in solid geometry. 


83.7. If two altitudes of a tetrahedron are coplanar, the edge joining the 
two vertices from which these altitudes issue is orthogonal to the opposite 
edge of the tetrahedron. 

Solution. Suppose that A P and BZ are altitudes from A and B respectively, 
and suppose they intersect in a point H (Figure 8.31). 
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Figure 8.31. 

_ A H is orthogonal to each of BC, CD, and BD, and BH is orthogonal to 

CD, AD, AC. We wish to show that AB is orthogonal to CD. For this, we 
compute the dot product: 

AB ■ CD = (HB - HA) CD=HBCD-HACD = 0-0 = 0. 

This completes the proof. 


8.3.8. Prove that if the opposite sides of a skew (nonplanar) quadrilateral 
have the same lengths, then the line joining the midpoints of the two 
diagonals is perpendicular to these diagonals. 


Solution. Let A, B, C, D denote the vertices of the quadrilateral, and let P 
and Q be the midpoints of A C and ^^respectively (Figure 8.32). We are 
given that \AD \ = \BC\ and \AB \ = \CD\. Squaring, and translating into 
dot-product language, we have 

AD • AD = BC ■ BC, 

AB AB = CD CD, 

or equivalently, 


(D-A)(D-A) = (C-B)(C-B), 
(B -A)(B~A) = (D-C)(D-C). 


C 



Figure 8.32. 
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We wish to prove that PQ is perpendicular to AC and BD; in vector 
language we wish to show that 

PQAC = 0, 

PQBD = 0, 

or equivalently, 

(e-p)-(c-i)-o, 

(e-F)-(o-i)*o. 

Substituting P = ±(A + C) and £> = + D), these equations yield 

(B + D - A - C ) • (C - A) = 0, 

V.. _ . \ J (2) 

(B + D-A ~C) (D-B) = 0. 

Our problem then is equivalent to showing that the equations (1) imply 
the equations (2). 

Expanding (I), we have 

D ■ D - 2A ■ D +A - A = C ■ C - 2B ■ C + B ■ B, 
BB-2AB+AA=DD-2CD+CC. 

Adding these, we get 

- 2A ■ D — 2A ■ B + 2A • A = 2C ■ C — 2B • C — 2C • D, 
-(B+C)A+AA = CC~C(B+D), 

(B + D) (C-i)- (C-C-A-A) = 0, 

(B + D) • (C — A) ~ (C + A) ■ (C - A) ~ 0, 

(B + D - C - A) ■ (C - A) = 0. 

This is the first of the two equations in (2). To get the second of the 
equations in (2), take the difference of the equations (I). The details are just 
as in the previous computation. 

In a similar way, adding and subtracting the equations in (2) yield the 
equations in (1), which means that the converse theorem is also true: 
namely, if the line joining the midpoints of the two diagonals of a skew 
quadrilateral is perpendicular to these diagonals, then the opposite sides of 
the quadrilateral are of equal length. 


Problems 


83.9. In a triangle ABC the points D, E, and F trisect the sides so that 
BC = 3 BD, CA = 3 CE, and AB = 5AF. Similarly, the points G, H, and I 
trisect the sides of triangle DEF so that EF = 3 EG, FD = $FH, and 
DE = 3DI. Prove that the sides of A GHI are parallel to the sides of 
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A ABC and that each side of the smaller triangle is j as long as its parallel 
side in the larger triangle. 

83.10. The sides of AD,AB,CB,CD of the quadrilateral A BCD are di¬ 
vided by the points E,F,G,H so that AE: ED = AF : FB = CG: GB 
= CH : HD. Prove that EFGH is a parallelogram. 

83.11. 


(a) In triangle ABC (Figure 8.33), points D and E divide sides BC and AC 
in such a way that BD/DC = 3 and AE/EC = §. Let P denote the 
intersection of AD and BE. Find the ratio BP : PE. 

(b) In triangle AB (Figure 8.34), points E and F divide sides AC and AB 
respectively so that AE/EC = 4 and AF/FB = 1. Suppose D is a point 
on side BC, let G be the intersection of EF and AD, and suppose D is 
situated so that AG/GD = \. Find the ratio BD/DC. 

83.12. On the sides of an arbitrary parallelogram ABCD , squares are 
constructed lying exterior to it. Prove that their centers M x , M lt A/ 3 , M 4 are 
themselves the vertices of a square. 

83.13. On the sides of an arbitrary convex quadrilateral ABCD, equilat¬ 
eral triangles ABM t , BCM Z , CDM 3 , and DAM 4 are constructed so that the 
first and third of them are exterior to the quadrilateral, while the second 
and fourth are on the same side of sides BC and DA as in the quadrilateral 
itself. Prove that the quadrilateral M 1 M 2 M 3 M 4 is a parallelogram, 

83.14. On the sides of an arbitrary convex quadrilateral ABCD, squares 
are constructed, all lying external to the quadrilateral, with centers M u M 2 , 



Figure 8.34. 
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A/ 3 , A/ 4 . Show that A/jA /3 = A/ 2 A/ 4 and that is perpendicular to 

M 2 M A . 

83.15. Isosceles triangles BCX, CA Y, and ABZ are constructed externally 
on the sides of a triangle ABC. Show that the centroids of &ABC and 
A XYZ coincide. 

83.16. The altitudes of a triangle ABC are extended externally to points 
A\ B‘ and C' respectively, where AA'=k/h a , BB'=k/h b , and CC" 
= k/h ( . Here, k is a constant and h a denotes the length of the altitude of 
ABC from vertex A, etc. Prove that the centroid of the triangle A’B’C' 
coincides with the centroid of ABC. 

83.17. Let A BC be an acute angled triangle. Construct squares externally 
on the three sides. Extend the altitudes from the three vertices until they 
meet the far sides of the squares on the opposite sides. Then the squares are 
cut into two rectangles. Prove that “adjacent rectangles” from different 
squares are equal in area. That is, prove that area / = area/' for / = 1,2,3 
(see Figure 8.35). (Use the dot product to give a one-line proof.) What 
happens as ABC becomes a right triangle? 

83.18. In a tetrahedron, two pairs of opposite edges are orthogonal. Prove 
that the third pair of opposite edges must also be orthogonal. 

83.19. Let O be a given point, let P u P 2 , be vertices of a regular 

n-gon, n > 7, and let Q t , Q 2 , .. ., Q„ be given by 

OQ, = OP> + , i = 12, ,..,n 

(P n + i = P|, P„ + 2 = Pi)- Prove that Q t , Q 2 , . . . , Q„ are vertices of a regu¬ 
lar n-gon. 



Figure 8.35. 
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8.4. Complex Numbers in Geometry 


In this section we will build on the geometry of complex numbers intro¬ 
duced in Section 3.5. 

8.4.1. A h A 2 , . ■ . , A n is a regular polygon inscribed in a circle of radius r 
and center O. P is a point on OA , extended beyond A v Show that 

n 

II PA k = OP n ~ r". 

*-1 


Solution. Consider Figure 8.36 as representing the complex plane with the 
center of the circle at the origin, and with the vertices A i at the n roots of 
z" — r n = 0. Specifically, we set the affix of A k to be z k = re 2v(k ~ (The 
affix of a point Q in the plane is the complex number which corresponds to 
Q.) With these coordinates, P corresponds to a real number, which we 
will denote by z. Then 

II PA k= f[ I* - z k\ 

jt — 1 

= n ( z - z k) 

k-\ 

= | z* - r"\ 

= z" — r” (z and r are real) 

= OP” - r*. 


8.4.2. Given a point P on the circumference of a unit circle and the 
vertices A ,,A 2 , ■ .., A„ of an inscribed regular polygon of n sides, prove 
that PA] + PA\ +•••-!- PAl is a constant. 



Figure 8.36. 
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Solution. Again, let A h A 2 . A n correspond to the n roots of unity; 

specifically, let the affix of A k be z k = e 2vk ^ n t k = 1,2, .. ., n. Let the affix 
of P be z. Then 

2 2 I*-**! 2 

*=i *—i 

= 2 (z - tk)(z ~ 4) ( w2 = ww) 

n 

= 2 {^-z h z-zz k + z k z k ) 

- 2 «- ( 2 t*) 1 ~ z i 2 + 2 v* ■ 

But = i z k = 0, since the z k 's are the roots of z n - 1 = 0 and the coeffi¬ 

cient of z" _1 is zero. Therefore, 

n n n 

2 = 2 ZZ + 2 z k^k 

X — l *-l k= I 

= 2 \ z \ 2+ 2 \ z k\ 2 

*—i *-i 

— n + n (|z| = 1 and \z k \ = |) 

= 2 n. 


8.43. Prove that if the points in the complex plane corresponding to two 
distinct complex numbers z, and z 2 are two vertices of an equilateral 
triangle, then the third vertex corresponds to — wz, — to 2 z 2 , where lo is an 
imaginary cube root of unity. 


Solution. Points Z|,z 2 ,z 3 form an equilateral triangle if and only if z 3 — z, 
= ( z 2 — z l )e ±m/3 . Thus, given z l and z 2 , z 3 must have the form 

z 3 = (l - e ± ” /3 )z l + e ± ^ / h 1 

= - [ - 1 + e ± * i/3 ]z 1 - [ - e-^ 3 ]z 2 . 

We can see from the geometrical interpretation of these quantities (Figure 
8.37) that - 1 + e ±m ^ and — e~ m ^ are the imaginary cube roots of unity. 
Alternatively, we can verify this algebraically: 

- 1 -I- e ±m ^ = - 1 + cos(± in-) + i sin(± iff) 

= -1 +{±^ 
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and 


_ „ ±m/3 s- _ 


[ cos( ± ^ jr) + i sin( ± j w) J 


— — \ + \ i. 

Conversely, suppose z 3 = — wz, — to 2 z 2 , where lo is an imaginary cube 
root of unity. Then 

u=-l + e m/3 and « 2 =-e m/3 
or 

w = - I + and ta 2 = — e~ v,/3 , 

and the previous arguments show that z l5 z 2 ,z 3 form an equilateral triangle. 


8.4.4. Equilateral triangles are erected externally on the sides of an arbi¬ 
trary triangle ABC. Prove that the centers (centroids) of these three equilat¬ 
eral triangles form an equilateral triangle. 


Solution. Let a,b,c be the affixes of A,B,C respectively (in the complex 
plane), with x, y,z the affixes of the centers of the equilateral triangles as 
shown in Figure 8.38. Let w = e 2ir '^. Then u> 2 + w + 1 = 0 (w is a cube 

roots of unity, so 0 = w 3 - 1 = (« - l)(w 2 + w + 1)). Also, note that e 17 ^ 3 

= -w 2 ande -m / 3 = — w. 

The centroid of a ABC has affix j(u + b + c). In a similar way, x, y, 
and z are given by 

= — w 2 (c — a ) ] ] = i [(2 4- « 2 )a + (I — w 2 )c], 

y = 4[a + 64-[a- w(6 — a)]] = ^[(2 + w)n + (1 — w)£>], 

z = j[fr + c + [i — w(c — A)] ] = ±[(2 4- u)b -I- (1 — w)c]. 

To show that x, y,z forms an equilateral triangle, it suffices to show that 

z - x = -<z\y - jc). 
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We have 

3(z - x) = - (2 4- w 2 )a + (2 + + (— w + u 2 )c 

— 3to 2 (/ — jc) = 3w 2 (jc ~ y) — (co 4 — tt) 3 )a — (w 2 — w 3 )f> + (w 2 — co 4 )c. 
But 

cd 4 — co 2 = w — 1 = ( - 1 — w 2 ) - 1 = - (2 4- to 2 ), 

(to 3 — co 2 ) = — <j 2 4- I = (1 4* to) 4- 1 = 2 + to, 
co 2 — to 4 = co 2 — to, 

and therefore, the coefficients of a, b, c in the above expressions for z - x 
and — u 2 (y - x) are equal. It follows that a\ y, z forms an equilateral 
triangle. 

Problems 

8.4.5. Let A 0 ,A l ,A 2 ,A 3 ,A 4 divide a unit circle (circle of radius 1) into five 
equal parts. Prove that the chords A 0 A t , A 0 A 2 satisfy 

(A 0 A, ■ A 0 A 2 f = 5. 

8.4.6. Given a point P on the circumference of a unit circle and the 
vertices A h A 2 , . , . ,A n of an inscribed regular polygon of « sides, prove 
that PA* + PA 2 +-••-(- PA* is a constant (i.e., independent of the posi¬ 
tion of P on the circumference). 

8.4.7. Let G denote the centroid of triangle ABC. Prove that 

3 (GA 2 + GB 2 + GC 2 ) + AB 2 + BC 2 + CA 2 . 
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8.4.8. Let ABCDEF be a hexagon in a circle of radius r. Show that if 
AB = CD — EF=r, then the midpoints of BC, OE, and FA are the 
vertices of an equilateral triangle. 

8.4.9. If Zj, z 2 , z 3 are such that |r,| = |z 2 | = |z 2 | = I and z, + z 2 + z 3 = 0, 
show that z b z 2 , z 3 are the vertices of an equilateral triangle inscribed in a 
unit circle. 

8.4.10. Show that z u z 2 , z 3 form an equilateral triangle if and only if 

z l + z 2 + z \ = z l z 2 + Z 2 Z 3 + Z 3 Z 1' 

8.4.11. The three points in the complex plane which correspond to the 
roots of the equation 

z 3 — 3 pz 2 4- 3 qz - r = 0 
are the vertices of a triangle. 

(a) Prove that the centroid of the triangle is the point corresponding to p. 

(b) Prove that ABC is an equilateral triangle if and only if p 2 = q. 



Glossary of Symbols and Definitions 


Centroid (of a 
triangle) 

Circumcenter (of 
a triangle) 

Convex hull 

Convex set 
Fibonacci sequence 

Function 

even 

odd 

convex 


The point where the medians of a triangle inter¬ 
sect. (A median of a triangle is a line joining a 
vertex to the midpoint of the opposite side.) 

The center of the circumscribed circle (the circle 
passing through the three vertices of the trian¬ 
gle). The point where the perpendicular bisec¬ 
tors of the sides of the triangle intersect. 

The smallest convex set which contains all the 
points of the set. 

A set that contains the line segment joining any 
two of its points. 

The sequence of numbers defined as F x * 1, 
F 2 = 1, and F„ = F„~ l 4- F n _ 2 for n > 2. The 
sequence begins 1, 1, 2, 3, 5, 8, 13, 21, 34, 
55. 

A function f with the property that /(- x) «= f(x) 
for all x. 

A function / with the property that /( — x) — 
- /(x) for all x. 

A real-valued function defined in the interval 
(a, b) such that for each x, y, z with a < x < y 
< z <b, /(x) < £(x), where £(x) is the linear 
function coinciding with /(x) at x and z. 
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concave 

n 

Incenter (of a 
triangle) 

Lattice point 

Orthocenter (of 
a triangle) 

Pascal’s triangle 

Pythagorean triple 

Set 

S-T 

fc-subset 

Triangular numbers 


A function which is the negative of a convex 
function. 

The greatest integer function; for each real num¬ 
ber x, M is the largest integer less than or 
equal to x. 

The center of the incircle of a triangle. The in¬ 
circle, or inscribed circle, is the circle tangent to 
the sides of the triangle. The incenter is the 
point where the bisectors of the angles of the 
triangle intersect, 

A point in the Euclidean plane (or R") whose 
coordinates are integers. 

The point of intersection of the three altitudes of a 
triangle. 

A triangular array of numbers whose nth row 
(n = 0,1,2, .. ,) is composed of the coefficients 
of the expansion of (a + b) n . 

A set of three integers which satisfy the equation 
x 2 +y 2 = z 2 . 


The subset consisting of those elements in the set 
S that are not in the set T. 

A subset of k elements. 

The numbers in the sequence 1,3,6,10,... 
whose nth term is n(n + ])/2. 
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